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Abstract 

We study the interaction of small amplitude, long wavelength solitary waves in the Fermi-Pasta-Ulam 
model with general nearest-neighbor interaction potential. We establish global-in-time existence and sta- 
p ^ ' bility of counter-propagating solitary wave solutions. These solutions arc close to the linear superposition 
of two solitary waves for large positive and negative values of time; for intemediate values of time these 
solutions describe the interaction of two counterpropagating pulses. These solutions are stable with respect 
i-^ ■ to perturbations in I 2 and asymptotically stable with respect to perturbations which decay exponentially at 
& , spatial ±oo. 

1 Introduction 

It has long been of interest in applied math and physics to study how energy and mass are scattered or 
transfered during the collision of two or more coherent objects. We study this question for the Fermi-Pasta- 
£q I Ulam (FPU) problem: 

q j = V(q j+1 -q j )-V'(q j -q j . 1 ) jeZ (1) 

which models an infinite chain of anharmonic oscillators with nearest-neighbor interaction potential V. On 
making the change of variables rj = qj+\ — qj and pj = qj, the state variable u = (r,p) satisfies a system of 
first order Hamiltonian ODEs, 

ut = JH'(u) (2) 
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where the Hamiltonian H is given by 



H{r, P ) = Y J \pl + V{r k ) (3) 

fcez 

The symplectic operator J is given by J = ( ^ ^-l ^ ^ ) where S is the left shift on bi- infinite 

sequences, i.e. {Sx) n := x n +\. The problem is well posed in each £ p space, but for concreteness and 
simplicity we work in £ 2 . 

The restriction of this problem to a finite lattice with Dirichlet boundary conditions was famously studied 
in 1955 on the MANIAC computer in order to determine rates of convergence to equipartition of energy 
[I]. Surprisingly, no such convergence occured. Instead, the authors observed nearly quasiperiodic motion in 
which the energy remained mostly in the first few modes. Later Zabusky and Kruskal [5] rediscovered the 
equation of Korteweg and DcVries (KdV) as a long-wavelength low amplitude limit of FPU and posited a 
connection between nearly recurrent states in FPU and soliton interaction in the KdV equation. The KdV 
equation has gone on to become both important and well understood as an example of a completely intcgrablc 
nonlinear dispersive wave equation. For an excellent review of this productive period sec Miura [3]. 

Less understood is the theory of solitary waves and their interaction in non-integrable dispersive wave 
equations. Martel, Merle, and their collaborators have studied the generalized KdV (gKdV) equation where 
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they have obtained existence and stability of asymptotic multi-soliton states [4], as well as the existence and 
stability of "nearly two-soliton solutions," that is, solutions which remain close to the linear superposition 
of a broad, small, slow-moving solitary wave and a narrow, large, fast-moving solitary wave for all time. So 
long as initially the large wave is to the left of the small wave, the large wave will overtake the small wave in 
finite time. Thus these solutions describe the collision between two solitary waves. Here the solitary waves 
survive the collision despite the fact that the system is not intcgrablc, but unlike the intcgrable case, the 
collision is inelastic - energy is transferred or scattered from the solitary waves to dispersive or radiative 
modes. 

In this paper, we examine the interaction of solitary waves in the FPU model which unlike gKdV admits 
two-way wave motion. In particular, rather than study the analogue of the KdV 2-soliton solution in this 
model, we focus on the interaction of counterpropagating waves. We restrict attention to the "KdV" regime of 
long-wavelength, low-amplitude initial data. In this near-integrable regime, solitary waves and their stability 
are well understood. In particular, see El \W{ QT] and the references therein. Our results are consistent 
with those of Martel and Merle, and the reasonable conjecture that in the near-integrable regime, collisions 
arc nearly clastic. 

Our main result yields an open set of initial data U C £ 2 such that for each uq € U, the solution u of {2} 
with initial data u(0) = uq satisfies 

lim ||u(t) - u c (t) - u c _(t)\\ < Ce-^" 1 Urn ||u(t) - Ug. (t) - Ue_ (*)|| < Ce 7 / 2 "" 

for some constant C which doesn't depend on e and small numbers < 771 < 770 < 1/2. Here u c+ and u c _ 
denote right- and left- moving solitary wave solutions to ([2|) and are order e 3 / 2 in the I 2 norm. The wave 
speeds at t = 00, which we write as c± may be different from the wave speeds at t = —00 which we denote 
here by c ± . This is stated more precisely as Theorem 12. II The interpretation is that before the collision, the 
error is exponentially small compared to the main waves and is due only to the weak interaction of the tails 
of the solitary waves. However, the collision itself transfers or scatters energy from the coherent structures 
u c+ and u c _ to radiative modes and perhaps smaller coherent modes. This scattering manifests in the error 
term for large positive times which is much larger than the error for large negative times, but remains very 
small compared with the size of the main waves. 

The strategy for the proof is to consider three different regimes: 

1. The pre-interaction regime where the solitary waves are well separated and moving towards each other. 

2. The interaction regime where the solitary waves are not well separated. 

3. The post-interaction regime where the solitary waves have already collided and are now well separated 
and are moving away from each other. 

The interaction regime is handled with a finite time energy estimate which is made precise in Theorem l2.3l 
The pre- and post- interaction regimes are handled with a stability result which is made precise in Theorem 
12.21 In order for these theorems to work together to describe the collision of solitary waves, the stability 
theorem must (a) be valid for the perturbations incurred by the finite time approximation and (b) show that 
these perturbations remain small compared to the main solitary waves. However, consider typical stability 
results, and in particular the ones in [6j [TQl |TT] which have the following form: 

There exists a 8 > and C > such that whenever the initial perturbation is smalleiQ than 8q < 8 
then the perturbation will remain smaller than CSq for all time. 

In order to extend the stability results of [SJ [TU1 [H] to the two-soliton case, we must quantify both <5 and 
C above. If 8q were to be less than the error incurred by either the weak interaction or the finite-time error 
estimate, that would present a fundamental mathematical obstruction to extending stability results for a 
single solitary wave to the two-wave context via the method we present here. Also, if C were to be larger 
than the size of the main wave u c+ or u c _ divided by the error incurred by either the weak interaction or the 
finite time error estimate, then although we could prove a theorem which controls the size of the perturbation, 
we would be unable to say that the perturbation remains small compared to the main waves. It turns out 
that in the stability analysis for solitary waves in FPU, the part of the perturbation which is localized near 

1 Here, in the interest of space, we are being intentionally vague about what we mean by "perturbation" and what we mean by 
"small" . This will be made clear later in the paper. 
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the solitary waves is relatively innocuous while any perturbation which is not localized is dangerous. Our 
main stability result, Theorem [221 is stated in the form: 

There are positive numbers Si oc ~ e 3+v , 6 no „i oc ~ e 9 / 2+r ', C; oc ~ 1, and C non i oc ~ e~ 3 / 2 such that 
so long as the (non)localized perturbation is smaller than <5( non )i oc then the perturbation remains 
smaller than Ci oc 8ioc + C no nlocSnonloc for all time. 

This allows us to prove that for e sufficiently small, there are global-in-time solutions which are close to the 
linear superposition of two counter-propagating solitary waves for all positive and negative time. Additionally 
the methods presented in this paper suggest a blueprint for studying the collision of small broad waves in a 
variety of dispersive systems which have two way wave motion. 

We leave open the important problems of whether or not asymptotic multi-soliton states exist, i.e. whether 
or not initial data may be chosen so that the radiation terms decay to as t — > oo, whether or not exact 
multi-soliton states exist, i.e. whether the initial data may be chosen so that the radiation decays to as 
\t\ — > oo, as well as a description of the collision of larger solitary waves. 

2 Main Results 

We assume that the potential V satisfies 

• HI FeC* 4 V(Q) = V'(0) = V"(0) = 1 V"'(0) = 1. 

We remark that while requiring that V"(0) and V"'(0) both equal 1 may appear restrictive, in fact it is 
not. To see this, suppose V(x) = \ax 2 + \bx 3 + 0(x A ) is a general potential with V(0) = V'(0) = and 
V"(0) > and ^"'(0) ^ 0. Write Newton's equations for a chain of coupled oscillators with interaction 
potential V: 

q n = V'(q n - q n -\) ~ V'(q n+ i - q n ). 

Now make the change of variables r n (t) = a(q n {(3t) — q n -i((3t)) and p n (t) = af3q n ((3t). Then u = (r, p) solves 
(J2J with V given by V(x) = \a(3 2 x + \b{f3 2 /a) + 0{x 4 ). Choosing (3 = a^ 1 ! 2 and a = b/a gives us HI. 

It was proven in [71 [51 [51 [TU] that under the hypothesis HI, there is a c upper > 1 and a family of solitary 
waves u c parameterized by wave speed c € (1, c upper ] for which u(k, t) = u c (k — ct — 7) is a solution for any 
7 G K and moreover the following hold 

• PI The profile r c takes values in an open interval /, Cl containing such that V"(r) > whenever 
r G I*. Moreover r c is even and decays at the exponential rate exp(«;(c)|x|) where k{c) is the unique 
positive root of sinh^K/ = c. 

• P2 For each ci ower G (1, c upper ) and for each a G (0, K,(ci ower )/2), the map 
(t, c) i-> e a 'u c (- - t) is C 1 from R x (c iotoer , c upper ) into £ 2 . 

• P3 The energy of the solitary wave H(u c ) satisfies = e J R 4>j + C(e 3 ) where is the KdV 
soliton. In particular dH ^ c ^ 7^ for e sufficiently small. 

Here PI - P3 were proven in [7j. Note that due to the time reversability of the equation (J^J), the properties 
PI — P3 also hold for c G [~c upper , —1). In our application of P2 we require the map (r, c) i— > e 2a u c (- — r) 
to be C 3 . We show that it has this additional smoothness in the proof of Lemma 13.21 

In the remainder of the paper we assume HI. Let c upper be as in PI - P3, choose ci ower G (1, c upper ) and 
fix a — a(ci ower ) so that PI - P3 hold whenever ±c G (ci ower , c upper ). Throughout the paper K and C refer 
to generic constants. 

Before we may precisely state our main results, we need to develop the weighted function spaces which 
we will work in. Let £ 2 be the Hilbert space of two-sided real-valued sequences {xk}k& which satisfy 
ll x l| 2 := Sfcez x k < 00 • Denote by £ 2 the weighted Hilbert space of two-sided real-valued sequences {xfc}fcez 
which satisfy ||x|| 2 := X^feez e^x 2 . < 00. Given real valued functions of a real variable r+(t) and r_(t) define 

Ikllr+Ct) : = e- aT +(')||a;|| Q = (Efc G z e 2a ( k - T +W x\) 5 . Note that the weight e ak in the weighted norm || • || r+(t) 
travels in the frame of a wave u c+ (- — r + (t)). Similarly, we define |a;|| T _(t) := e aT -(*'||:r||_ a . In the following 
the function r Q is allowed to vary depending on the context. For example, in pip we take r Q = t* for 
a G {+, — } and in the statement of Theorem 12.31 we take r a = a(c a t — T ). Aside from these two theorems, 
r a will denote the phase of a solitary wave which is allowed to vary in a way fixed by the modulation equations 
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in section 3. In the following we will abuse notation, writing ||x(t)|| a to denote ||x(i)||T Q (t)- We also write 
x+ and X- to be the spatially right and left parts of x respectively. That is x+(k) = x(k) when k > and 
x+(k) = otherwise; similarly x-(k) := x{k) — x+(k). 
We may now state precisely our main result. 

Theorem 2.1. Assume (HI). For each < rji < rj < 1/2, there is an e > such that for each < e < Eq 
there is a constant C > and an open set U C £ 2 such that for each uq € U there are smooth real valued 
functions of a real variable c+, c_ 7 r + , and r_ such that if u denotes the solution of ([2]) with u(0) = uq, then 
its perturbation from the sum of two solitary waves 

v{t) := u{-,t) - u c+(t) (- - r+(t)) - u c _( t )(- - r-(t)) 

satisfies 

\\v{t)\\ < Ce-^ V1 t<0 (4) 

and 

\\v(t)\\ < Cs 7/2 " 2, '° t>0. (5) 
Moreover, the weighted perturbation decays: 

lim V ( e 2«(fc-r+(t)) + e -2a(fc-T-(t))) v(jfc t) 3 = nm V- (e -2a(fe-r + (t)) + e 2a(fe-r_( t )h - ^ fl2 = g (g) 

feez feez 
Furthermore, the limits 

lim (c + (t),C-{t),T+(t),t-(t)) and lim (c + (i),c_(£),f + (t),f_(t)) 

t — ^oo r — > — oo 

exist with lim^ioo f a {t) — c a (t) = for a G {+, — }. 

In addition, there is a subset ofU which lies in the weighted space Uq C £ 2 r\£ 2 _ a nU and which is open in 
l\ fl £_ a such that whenever uq £ Uq the weighted perturbations decay at an exponential rate. That is, there 
is a b > such that the solution u of ^ with u(0) = uq satisfies 

^ (e 2a(fe-r + (t)) +e -2a(fc-T_(t)) )w(M) 2 < C& -U t>Q (?) 

fcez 

and 

^( e -Mk-r+m + ^(k-r-(t))) v (ls )t ) 2 < Ce U t<0 (8) 
fcgZ 

Theorem 12.11 asserts the existence of solutions which remain close to the linear superposition of two 
counter-propagating solitary waves forever, both forward and backward in time. In particular, solitary waves 
persist after collision. Moreover, the energy which is transfercd from the solitary waves to radiative modes or 
smaller solitary waves is small relative to the main waves, whose £ 2 norms are C(e 3//2 ). Furthermore, these 
counter-propagating wave solutions are stable with respect to perturbations in £ 2 . In addition, solutions which 
differ from the sum of two counter-propagating solitary waves by a sufficiently small exponentially localized 
perturbation at some initial time, continue to differ from the sum of solitary waves by a small exponentially 
localized amount, both forward and backward in time. 

In our proof we combine an orbital stability result for weakly interacting waves with a finite time ap- 
proximation result for strongly interacting waves. There is a tradeoff here between the exponential control 
of the radiation at t < and the polynomial control over the radiation for t > 0. This tradeoff is a result of 
choosing the time period over which one uses the finite time approximation. If one uses a relatively short time 
period, so that the waves just have time to pass through each other, then the polynomial error incurred by 
the approximation is smaller. This corresponds to a small value of 770 above. However, this has consequences 
for the bounds in the orbital stability result because if the waves start out relatively close to each other, then 
the cross terms which come from the interaction will be larger. This corresponds to a small value of r\\ above. 
On the other hand, if one uses finite time approximations for a larger time period, then the error incurred 
by the approximation will be larger, but the interaction terms before and after this period will be smaller, 
corresponding to relatively large values of r\\ and t\q. 

The orbital stability and finite time approximation results are of independent interest and we now state 
them as separate theorems. 
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Theorem 2.2. Assume (HI). For each rji > there are positive constants C, Si oc , 5 non i oc , and Eq such that 
for any e G (0,£o) the following hold. Suppose that there is a decomposition 

uo(-) - u c * + (- - t* ) - u c -*_ (• - r* ) = v w + v 2a 

such that \\v w \\ < S nordoc and ||(u 2 o)||+ + \\(v 2 a)-\\- + |K'2oj| < S toc with t* < -Ce' 1 '^ < Ce -1- '? < r*, 
finonioc < Ce 9 / 2+r> i an d § [oc < Cs s+ri . TVien i/iere are smooth real valued functions of a positive real variable 
c+, C-, t + , and t_ such that the solution u o/@ with u(0) = Uq satisfies 

IK*. •) - u c+{t) (- - r + (t)) - u c _ (t) (- - r_(i))|| < Ce- z ' 2 5 nonloc + CS loc + Ce-^ 1 ' for all t > (9) 
and 

lim ||ti(t,.)-« c+ (t)(--r + (t))-« c _ W (--T_(t))|| a = (10) 

Moreover, the limit lim t ^ 00 (c + (t), c_ (i), r + (t), f_(t)) exists with lim^oo T a (t) = c a (t) for a 6 {+,—}. 
Furthermore, if t>io = 0, £/ien 

lk(t)a||a<Ce- bt a £{+,-} (11) 

where in the small £ regime b ~ £ 3 . 

Note that we are fairly quantitative in determining the small e asymptotics for the initial data whose 
evolution we are able to control as well as for the degree to which we are able to control it. This is an 
essential feature of our method as the stability result must be able to tolerate the error incurred by the 
following finite time estimate. 

Theorem 2.3. Assume HI. For any < 771 < 770 < 1, there exist positive constants Eq, Co, G\, and Eq 
such that if c_ = —1 — -^e 2 /?_ and c + = 1 + -^e 2 /3 + , with \f3±\ < 1 and < e < £0 and < E < E , then 
whenever u is a solution of (|5J) with 

||u(0) - u c+ {- - 7+) - u c _ (• - r_)|| < e u ' 2 -^E 

for some rj > 0, then the difference between the solution of ([2]) with initial condition u(0) and the linear 
superposition of waves has a decomposition 

u(t) - u c+ (- - c + t - 7+) - u c _ (• - c-t - r_) = + v 2 (t) 

such that 

\\vi(t)\\ <C e n / 2 - 3 ^(E + l) 

and 

\\v2(t)\\ + \\v 2 (t)\\+ + IM*)||- < C e 7 ^'(E + 1) 
hold for < t < Coe -1- '' 1 . If in addition, 

||u(0) - «*,(■ - r+) - Uc _(. - r_)|| + + ||«(0) - u c+ (- - 7+) - Uc _(- - r_)||_ < C l£ 7 ^' 

then we may take v\ = 0. 

The proof of Theorem 3 follows from a straightforward, albeit nonstandard, energy estimate. The following 
lemma will help to streamline the proof. 

Lemma 2.4. Consider the nonlinear inhomogeneous discrete wave equation 

( r = (S-I)p 

\ p={I-S- 1 )r + g(t,r,p) 

Let To € K and a ~ e be given and let c a be given such that ac a — 1 ~ e 2 for a 6 {+,—}. Let || • || denote the 
standard t 2 norm and let \\ ■ \\ a denote the weighted norm with phase r a = c a t — olTq. 

Assume that there are positive constants Co, C\, and r/o € (0, 1) which do not depend on e such that for 
e sufficiently small we have ||<7(£, r, p)\\ < Co£ 1+r?0 for < t < Cie -1- ' 70 Then 

Ht)\\ 2 + \\p{t)\\ 2 < e CoCl (||r(0)|| 2 + ||p(0)|| 2 + 1) < t < de" 1 -" 

Similarly, if \\g(t, r, p)\\ a < Co£ 1+r, ° for < t < Ci£~ 1 ~ ??0 then for e sufficiently small 

\Ht)\\l + lb(*)lla < e 2CoCl (IIK0)|| 2 + b(0)|| 2 + 1) < t < de- 1 -™ 
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Proof. We consider first the unweighted £ 2 norm. Let £(t) := \\r(i)\\ 2 + \\p(t)\\ 2 . Compute 
j t £{t) = (r, (S - I)p) + (p, (I - Or) + ( P ,g) = (p,g) < \\g\\(£ + 1). 

In the second equality we have used the fact that S* = S^ 1 when regarded as an operator on £ 2 . In the last 
inequality we have used the Cauchy- Schwartz inequality as well as the fact that x < x 2 + 1 to replace ||p|| 
with £ + 1. From Gronwall's inequality and the smallncss condition on g we obtain 

£(t) < e CoCl £(Q) + e CoCl - 1 < t < Cis" 1 "" 

as desired. 

Now consider the weighted norm. Let £ a (t) := \\r(t)\\ 2 a + Wplt)^ and let (x,y) a := 2 fceZ e 00 ^ -0 "^^^ 
so that ||x||^ = (x, x) a . Compute 

jf t £ a {t) = -2aac a £ a + {(S* - S~ 1 )r 1 p) a + (p,g) a 

< (e a {l - e- 2a ) - 2aac a ) £ a + \\g\\ a (£ a + 1) 

< 2C e 1+ ^(£ a + l) for < t < de- 1 ^". 

In the second line we have used the fact that S* = e~ 2a S'~ 1 and H-S^ 1 !! = e a in the space £ 2 to bound the 
first term and we have used the Cauchy-Schwartz inequality together with the fact that x < x 2 + 1 to bound 
the second term. In the third line we have used the fact that ac a — 1 ~ s 2 and a ~ e to show that the first 
term is higher order in e than the second term and thus is no bigger than the second term for sufficiently 
small e. We have also used the smallness assumption on g to simplify this term. The result again follows 
from Gronwall's inequality. □ 

We can now prove Theorem 12.31 

Proof. The idea of the proof is to write u = u c+ + u c _ + v 2 + V\ where v 2 is localized and serves to cancel 
the leading order terms in ^||''i|| 2 . As a preliminary step we study the following system of equations: 

\ 2 = (/ - S- 1 ) [0x + 2e- 7 / 2 +"V c+ r c _] 

where r c+ and r c _ are solitary wave solutions of the FPU equation evaluated at k — c+t — Tq and k + c + t + To 
respectively. Our aim is to apply Lemma [2^41 to show that remains small on timescales of order e -1- ' 70 . To 
that end, we estimate 

^o-7/2|| (7 _ 5 -l )rc+r , c _|| Q = £ *7o-7/2|| rc+(/ _ S -ly c _ + ( S -l rc _)(J _ Ore+Ha 

< Ce'»- 1 / 2 ||r Cai || a + Ce , »- 3 / 2 ||(J-S , - 1 )r Ca || a < Ce 1+r '°. 

In the first inequality in the second line we have used that || (I — S~ 1 )r c _ a ||^» < Ce 3 and that |jr c _ a ||^oo < 
Ce 2 , which follow from (|14[) and the Mean Value Theorem. In the second inequality we have used that 
||r Ca .||a < Ce^l 2 and that S~ 1 )r Ca \\ a < Ce 5 ^ 2 which also follow from (|14[) and are again established in 

Lemma [3.21 . We remark that (at least for t < Tq), \\r c _ a \\ a is exponentially large, thus it is essential that 
we are able to use the £°° norm rather than the || • || a norm to estimate r c _ a . The bound for the £ 2 norm is 
similar. 

We consider now the system 

^ = (S - I)fh 

i> 2 = (I- 0(V»i + 2 £ - 1 +*((r c+ + r c _)0!) 

We again use Lemma [2.41 to bound both the £ 2 and weighted norms. Here it suffices to use the fact that 
1 0i 1 1 + ||0i ||+ + || 0i ||- < C and ||r c+ +r c _||^oo < Ce 2 to bound the inhomogeneous term. 
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Now let V2 ■= £ 7 / 2 i°<f> + e 9 / 2 2710 ip. We define the residual V\ = (R, P) by the ansatz 
r = r c+ + r c _ + £ 7 / 2 -''°0i + e 9/2_2?,0 V'i + £ 11/2 ~ 3r '°i? 
P = Pc + + Pc_ + e 7 / 2 - 71 "^ + e 9 ^- 2r >°tp2 + £ n /2-3»?op 

where <f> and ip are as above. 

Upon substituting the above ansatz into ^ we see that R and P satisfy the differential equation 

( R = (S-I)P 

I P = (I-S-^R + g 

where the inhomogeneous term is 
9 = (I- S- 1 [(r c+ + r c _ + R)] (i) 

+£ 3/2+r ;0 ( / _ + gl-DO^ + e 2 - 2 *^ 2 ] (m) 

+3e- 1 V2+3 W(rc++rc _j [(l-^ 1 )^^)] +S- 1 (r c+ r c _)(/-^ 1 )(r c+ +r c J) (in) 

+£ 2,70-2(J _ ^ 1 + £1-770^ +e 2 - 2 ™R)F 3 (r c+ ,r c _,£ 7 / 2 -™^ £ 9/2-2 V0 ^ u£ U/2-3 VaR ^ 

+ £ 3'7o-li/2(j _ S-i)f 4 ( rc+ , r c _ , e 7 / 2 -*^,^ 2 - 2 '^^ 11 / 2 - 3 *^ («). 

Here F3 is a homogeneous polynomial of degree 2 in its arguments and accounts for the third order terms 
in the Taylor expansion of V about other than those which arise from (r c+ + r c _) 3 . The function F4 
accounts for the fourth and higher order terms in the Taylor expansion of V about and is quartic in all of 
its arguments. 

From fact that ||r Ca ||.«°= < Ce 2 we see that (i) < Ce 1+,,a so long as \\ipi\\ remains 0(1) and |ji?|| remains 
0{e 7 ^- 1 ). Similarly (ii) < Ce 3 / 2 +* so long as \\(j)\\ remains 0(1), \\ip\\ remains 0(e m 1 ) 1 and ||i?|| remains 
0(e 2r >°- 2 ). To bound (Hi) we use the fact that ||r c J|^ < Ce 2 , that || (I - 5 _1 )(r c+ r c _ )|| f oc < Ce 9 / 2 , that 
||5'~ 1 (r c+ r c _)|| < HrcJHkcJI < Ce 1 ' 2 and that \\(I - S^rcJ < Ce 3 to conclude (Hi) < C£ 1+3r >°. 

Since F3 is quadratic, we have (iv) < Ce 2+2na so long as its arguments remain 0(e 2 ) and the coefficient 
remains 0(1). That is, we have (iv) < Ce 2+2T,a so long as <fi remains 0(1), tp remains 0(e I?o_1 ) and R 
remains 0(e 2 ~ 2ri0 ). We similarly conclude that (v) < e 2+3l '° so long as \\<f>i\\ remains O(e r,0 ~ 3 ^ 2 ), 
remains O(e no ~ 5 ^ 2 ), and ||J?|| remains 0(e 710 ^ 7 ^ 2 ). Note that we have only had to assume that ||i?|| remains 
Oie' 10 - 1 ) on timescales * ~ e^ 1 ^ . Thus from Lemma l2~4l it follows that \\R\\ 2 + ||P|| 2 actually remains 0(1) 
on timescales t ~ £~ 1 ~' )0 . 

To conclude the proof it remains only to show that if the initial perturbation is localized, it remains so 
for the timescales of interest. To that end, redefine the residuals R and P by the ansatz 

r = r c+ + r c _ + e 7/2 -'"°R p = p c+ + p c _ + £ 7l2 ^ a P 

and substitute into ([2]) to obtain 

( R=(S-I)P 

\ P=(I-S- 1 )R + g 

where g := e*- 7 / 2 (7 - S*- 1 ) (V(r c+ + r c _ + e 7 ' 2 -^') - V'(r c+ ) - V'(r c _)- e 7 ' 2 -^R). Use Taylor's Theo- 
rem to write V'(r c+ +r c _) = V'(r c _) + V"(r c _)r c+ +V'"(r c _)r 2 + +V""(9 1 )r 3 + and V'(r c+ +r c _ +e 7 / 2 -'''°i?) = 
V'{r c+ +r c _) + V"{r c+ + r c _ )e 7 / 2 ~ 1 i°R + V" (e 2 )e 7 ^ R 2 . Thus 

g = £ ^- 7 / 2 (I - S- 1 ) [(V"(r c _) - l)r c+ + V"'(r c _ )r 2 c+ + V""(e 1 )r 3 c+ 
+ (V"{r c+ + r c _) - l)e 7 ' 2 -^R + V" (9 2 )e 7 - 2r ' a R 2 ) 
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The quantities 6i are bounded uniformly in both space and time, so long as remains bounded. Since 

we are trying to prove that R remains bounded, and we must assume that R remains bounded in order to 
bound V"'(0i), we proceed with some care. More precisely, assume that there are some positive constants C4 
and C\ so that ||-R||f°° < C4 for < t < Ci£ -1- ' )0 . It follows that there is some positive constant C5 so that 
\V""(6 1 )\ + \V"'(e 2 )\ <C 5 . Thus 

11.911+ < ^0-7/2 ( || rc _ || (J _ ^-i )rc+ 1| + + c|| rc+ 1|^ || (Z _ ^-^r^ ||+ + C 5 ||r c+ HI^ ||r c+ 1| + ) 

+ C\\r c+ + r c _ \\ t ~ + + C 5 e 7 / 2 -"||i?||^ ||i?|| + 

< Ce 1+7 ' a + C 5 e 2 ' 7 ' + Ce 2 \\R,\\+ + C 5 e 7 / 2 ^°\\R\\ e ^\\R,\\ + . 

so long as remains bounded. We can always choose e small enough so that 

\\g\\+ <2Ce 1+m (12) 

where C is a constant which depends upon neither e nor R. Thus, so long as so long as the above is 
valid, i.e. \\R(t)\\ e ^ < d then from Lemma 4 we have \\R{t)\\\ + ||P(i)||+ < e 2CCl (||i?(0)|| 2 + ||P(0)|| 2 ) for 
< t < Cie -1- '' 1 . We may now, a posteriori choose C4 = 2e 2CCl and make e smaller if necessary so that 
(fT2|) holds. 

One may obtain similar estimates for ||.gj|- and \\g\\. In particular 

11.911+ + ||.9||- + ||5ll <Ce 1+ * 
so long as ||i?j|+ + \\R\\- + ||-R|| remains 0(1). This concludes the proof. □ 
We now show that Theorem 12. II follows from Theorems 2 and 3. 



Proof of Theorem \2.1[ The strategy for the proof is to consider three different regimes: 

1. The pre-interaction regime where the solitary waves are well separated and moving towards each other. 

2. The interaction regime where the solitary waves are not well separated. 

3. The post-interaction regime where the solitary waves have already collided and are now well separated 
and are moving away from each other. 

We choose initial data in the pre-interaction regime which is exponentially close (in £ 2 ) to the sum of two 
well separated solitary waves. In light of the symmetry u(n,t) 1— > u{—n, —t) for @ we may apply Theorem 
(|2.2p backwards in time. In particular, using §§§ with Si oc — S non i oc — e~ e 1 and r + = — = e~ 1 ~ 7 ' 1 yields 
{4|. To control the evolution in the interaction regime we use Theorem l2.3l However, note that the definition 
of the weighted norms change under the symmetry u(n, t) t— > u{—n, — t); when the solitary waves are moving 
away from each other the weighted norms don't see mass that lies in between the main waves, whereas when 
the solitary waves are moving toward each other they do. Thus to ensure that the localized error V20 is small, 
we must take S non i oc = e~ £ v ° above. Having done this, the localized error V20 is order £ 7 / 2 ->)o anc [ ^n e 
£ 2 error v\o is order e 11 / 2 -^ r io m interaction regime. In light of the symmetry u(n, t) 1— > u(n, t — T) we 
may again apply Theorem Using © with 5 loc = Ce 112 ^ and 5 nonloc = Ce 11/2 ~ 3r, ° yields ©. The 
weighted norm estimate © follows from p0[) and convergence of modulation parameters similarly follows 
from Theorem 12.21 

To complete the proof it remains only repeat this procedure for initial data which is exponentially close 
to the sum of solitary waves in the weighted space £ 2 fl l 2 _ a - Using (fTi"]) with S non i oc = 0, Si oc < Ce 1 l 2+r) and 
t+ = —t! = e -1- '' 1 yields (JT]). Upon applying Theorem 12.31 and using the fact that the initial perturbation 
is localized, we have vio = and V20 is order e 7 / 2- '' . Using (fTTj) with Si oc = Ce J / 2 ~ ,la and S non i oc = yields 

(St. □ 



The rest of the paper is devoted to proving Theorem [2721 Before sketching its proof, we briefly review the 
previous stability results [101 E] on which we build. The KdV equation is 

u t = u xxx + 3(u 2 ) x . (13) 
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Traveling waves u(x, t) = Q c (x — ct) satisfy the wave profile equation cQ + Q" + 3Q 2 which can be solved 
explicity to obtain Q c (£) = c sech 2 (-y/c£/2). These solitary waves Q c {- — t) form a 2-manifold in the space 
of initial data, parameterized by allowing the speed (or mass) c and the phase r to vary independently. To 
study perturbations of solitary wave solutions, one makes the ansatz u(x, t) = Q c (t)(x — r(t)) + w(x, i). The 
evolution of the perturbation w is governed by a dispersive PDE. In order to control its evolution, Pcgo 
and Weinstcin [6J work in an exponentially weighted space and prove that w(t) tends to zero at t — > oo in 
this norm. The evolution of the speed c(t) and phase r(t) are governed by ODE's which are coupled to the 
PDE for w(t). The control of w in the weighted norm, together with energy estimates which come from the 
conservation of the Hamiltonian and an orthogonality condition on w, is sufficient to imply convergence of 
f(t) and c(t) as t — > oo. 

In 1999 Friesecke and Pego showed that in the "KdV" regime of long- wavelength, small amplitude traveling 
waves whose speed is close to the "sonic" speed, which is c son i C = ±1 in our case, FPU solitary waves could 
be well-approximated by KdV solitary waves. More precisely, they showed that if the speed c is sufficiently 
close to 1 (or —1), and if one writes c 2 = 1 + -^e 2 , then the traveling wave profile satisfies the estimate 

\\^r c (-)~M-)\\m m <Ce 2 (14) 

where </>i(£) = |sech 2 (^/2). This closeness between KdV and FPU solitary waves is instrumental in importing 
estimates for KdV linearized about a solitary wave to the FPU setting. 

In a series of papers from 2002 to 2004 [8l [9j [10] , Pego and Friesecke showed that these waves are stable 
with respect to perturbations which are small in both I 2 and a weighted space. Their method of proof is 
adapted from [6J and indeed relies heavily on results presented there. This series of papers provides a blueprint 
for importing estimates which are known for the KdV equation to equations which have KdV as a long-wave 
low-amplitude scaling limit. However this work, as well as that of [B], is limited to perturbations which lie in 
a weighted space and thus says nothing about the behavior of solutions which differ from a solitary wave by 
a perturbation which decays very slowly. 

Orbital stability in the energy space for gKdV solitons was obtained by Martel and Merle in [12]. The 
conceptually simplest version of the argument relies on a virial identity for the perturbation equation. That 
is, small perturbations to the solitary wave never linger near the solitary wave - regardless of whether they 
are exponentially localized or not. Since evolution of the modulation parameters c and r is controlled by the 
mass of the perturbation near the solitary wave, one may use this estimate to control the evolution of c and 
r, and thus use energy estimates to obtain Lyapunov stability for the solitary wave. 

More recently, Mizumachi [11] has obtained orbital stability in I 2 for solitary waves of {2J in the KdV 
regime via an adaptation of [5] which uses ideas present in |12| . Unfortunately we know of no virial identity for 
the evolution of perturbations to solitary waves in FPU. However, it is not hard to show that exact solutions 
of FPU with small initial data move more slowly than larger solitary waves. Mizumachi decomposes the 
perturbation into two pieces, one of which is an exact solution and the other of which is exponentially 
localized [IT]. With this decomposition one can show that FPU solitary waves are stable to all sufficiently 
small perturbations in £ 2 even if they are not exponentially localized. 

With this background in mind, we are now ready to outline our proof. We first make the ansatz 

«(*) =«c+(t)(- -r+(*))+« c _(t)(- -r_(t))+«(t). (15) 

Here we allow the modulation parameters c a and r a for a £ { + , — } to vary in order to control the neutral 
modes associated with variation in wavespeed and phase. This control is achieved via orthogonality conditions 
which we place on the perturbation v. 

It is a consequence of the convexity (and conservation) of the Hamiltonian that so long as these orthogo- 
nality conditions are satisfied, we obtain 

\\v{t)f < C (||v(to)|| 2 + e\c+(t) - c+(to)\ + e\c-{t) - c_(t )|) + exp. (16) 

Here exp denotes the exponentially small terms generated by the interaction between the tails of u c+ and 
u c _ . Thus to obtain orbital stability it suffices to restrict attention to the region where the interaction terms 
are small and to control the evolution of the modulation parameters c a . 

Substituting the ansatz (|15|) into ([2]) we obtain a lattice differential equation for the perturbation v which 
is coupled to a system of ordinary differential equations for the modulation parameters c a and r a . If one 
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localizes the perturbation to the left- and right- half lattices by writing v = v + + V- then the linear part of 
the lattice differential equation for v a generates a semigroup which was studied in [10] and is known to decay 
exponentially for initial data which are exponentially localized and orthogonal to the neutral modes. Given 
arbitrary initial data v G I 2 , we follow in decomposing it as v = v\ + «2 where v\ is an exact solution of 
|(5J) and t>2 is exponentially localized. We may now further localize i>2 by writing V2 = t>2,+ + v 2,- with vi. a 
near u Ca . Now V2, a lives in the space on which the semigroup generated by the linear part of its evolution 
equation decays. Thus we may write a variation of constants formula for V2. a - However, this equation is now 
coupled to the exact solution part of the perturbation V\ as well as the modulation parameters c a and r Q 
and cross terms associated to u c _ a . To give a sense for the relationship between the quantities involved, we 
write the variation of constants formula here: 

K«(*)Ha < Ce-^-^CKafoJHa + Ka(to)||wJ + C\\ Vl , a {t)\\w a + C f e~ b ^ \\G a (s)\\ a d S 

where 

\\G a (s)\\ < e 2 ||«i ia (s)|| V ir +e- 1 /2|c a (a)|e B / 2 (f a -Ca) 

+ (l^a( s ) - Ca(s)\ + \c a (s) - C a (t )\ + \\v(s)\\ + £ 4 ) 1 1 V 2 , a (s) 1 1 a + exp 

Here the two-sided slowly decaying weighted norm |j • \\w a is given by 

IMlkM-E^ 25 '^'^ ( 17 ) 

and a ~ e 2 . The reason for this choice is made clear in the proof of Lemma 1331 

As is often done in stability analysis for PDEs, we first restrict attention to solutions for which G a above 
is small. So long as this holds, after some computation and applications of Gronwall's inequality and Young's 
inequality, we obtain 

/ 1 KoWlla* < Ce~ 2 f 1 \\v ha (t)\\ 2 w Jt + Ce- 3 (\\ Vl , a (to)\\w a + Ka(*o)||a) + exp 

J to J to 

Since we have followed Mizumachi and chosen V\ to be an exact solution of ||5J) we may use the virial identity 
f51]) to see that 

/ IKaO)llk <Ce-*||t;i(to)|| 2 + exp. 

J to 

Through a separate analysis, which also mimics [6l[8l[TT], we obtain 

Ca < C(e 2 + a|| + ||w2,«||)(||wi,q||^/ q + ||«2,a||a) + exp. 

and upon integrating this equation and using the above, we see that |c Q (i) — c a (to)| remains small as long as 
G a does. We may now conclude a posteriori that G a and hence |c a (i) — c a (t )\ remains small for all time. 
In particular, wc may use (|16p to conclude that |w(t)| remains small for all time. 

Note that large negative powers of e appear in several of the estimates above. These large coefficients 
arise because in the small amplitude long wavelength regime, solitary waves move only slightly faster than 
smaller amplitude radiation, which disperses at the sonic speed c = 1. Thus, the weighted norms associated 
with these solitary waves decay very slowly, hence have very large integrals with respect to time. Given that 
the large coefficients are present, controlling the above quantities requires some care. 

3 Coordinates for the FPU Flow Near the Sum of Two Solitary 
Waves 

As discussed in [71 [S] there is a 2 dimensional manifold of wave states parameterized by wavespeed c and 
phase t. Define the tangent vectors £i(t,c) := d x u c {- — r) and ^(^c) = d c u c (- — r) so that £i(t, c) and 
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^2(t, c) span the tangent space of the manifold of wave states at the point u c {- — t) 
(r a , c Q ) we suppress the argument and refer to it as £j iQ . 
In the spirit of j8] we define the bilinear forms 



When we evaluate £j at 



raGZ L 



X2,k + U2,n 



n-1 

E 



fc=-c 



and 



^ X 2 , fe + J/2, n Xi, 



w_(x,y) = -^ 2/1,. 

nEZ L A;— n+l k—n 

Observe that uj+{x,y) — u>-{x,y) = X) n ez Sfeez(y i .™ X2 > fe + J/2,nXi,fc) so that whenever either a; or y is zero 
mean, the forms u>+(x,y) and u-(x,y) agree. Also, when either x or y is zero mean, the forms have the 
anti-symmetry property 

uj a (x,y) = -u> a (y, x) . (18) 

For this reason we refer to io a as a symplcctic form even though (fT8|) does not hold in general. A particular 
case when x is zero mean is when x is in the range of J, thus u> a (Jz,y) — (z,y). The reason that we take 
care to define two separate forms is because we will have occasion to apply them to sequences which needn't 
be zero mean such as £2,0- The key difference between w + and w_ is appears in how they act on the weighted 
spaces £ 2 and £ 2 _ a . 

When a > the form lo + is continuous when regarded as a map from £ 2 _ a x £ 2 to R but not when regarded 
as map from £ 2 x £ 2 _ a — > R. Similarly, w_ is continuous when regarded as a map from £ 2 a x i 2 _ a to R, but 
not when regarded as a map from l 2 _ a x £ 2 to R. Both w + and w_ arc only defined on a dense subset of 
£ 2 x £ 2 — > R and neither arc continuous with respect to the ^ 2 x £ 2 topology. The following useful inequality 
is an immediate consequence of the Cauchy-Schwartz inequality for the £ 2 , £ 2 _ a dual pairing. 

1 



1 



\->'\\(- 



wui. 



\x\\p_ \\yhi t 



(19) 



In the tradition of Pego and Weinstein [6], Pego and Friesecke [8] , and Mizumachi [TT] we impose 
orthogonality conditions on the perturbation of our ansatz. We seek to study solutions u of ^ which satisfy 



u(k, t) 



where vi(k,t) also satisfies ©. 
Denote v\ a = Vih a and v = v\ - 



1 



c+(t)(k - T+(t)) + w c _( t )(fc - r_(t)) + ui(fc,t) + U2(fc,t) 

Let h + (k) = X[o.oc){k) be the Hcavisidc function and h_(k) 
- V2- The orthogonality conditions that we impose on V2, a are 

Wa(f2,Q,f2,tt) = a G { + ,-}■ 

Remark. _ffere condition (|21[) is imposed on the full perturbation v while condition (|22p is imposed only on 
t>2- Pego and Friesecke do not decompose the perturbation and require u>(£i, v) = 0. Mizumachi introduced the 
analogue of (|22[) ^/or a single wave) which allowed him to control the modulation equation for c a well enough 
to obtain orbital stability. 

Substituting {20]) into © we obtain an evolution equation for V2,a- 



(20) 
h+(k). 

(21) 
(22) 



where 



L a V2, a = (Jg a )h a + l a 
L a := d t - JH"(u Ca ), 



(23) 
(24) 



h a , and 



(25) 



^2 cp&'p ~ fa ~ c ^ the 

g a := H'(u c+ + u c _ + v x + v 2 ) - H'(vi) - H'{u c+ ) - H'(u c _) - H"(u Ca )v 2 , a . (26) 

Our aim going forward is to analyze the evolution of the quantites 1)2, fi, c and r rather than our solution 
u. Before we may do this in earnest, we must show that the two are equivalent; i.e. that (r, c, Vi, 1)2) form a 
coordinate system for u. Moreover, we must show that these coordinates are valid even if v\ and V2 are as 
large as the radiation due to collision could be, i.e. as large as the error terms in Theorem [ 
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3.1 Estimates in the KdV regime 

In Theorem 12.21 we are careful to demonstrate how the size of both an admissible initial perturbation S and 
the evolution of that perturbation depend on e. This is necessary so that we may establish that error incurred 
from the finite-time approximation yields an initial condition which is not too far from the sum of solitary 
waves and moreover remains small compared to the main waves. This requires an understanding of how each 
constant which appears in [101 111] depends upon e. In this section, we establish small e asymptotics for the 
higher derivatives of the wave profile with respect to c and r. Some of the details of the computations appear 
in the Appendix A.l. 

The small e asymptotics for the first derivative of the wave profile are established in [7j. In this section, we 
refine those estimates. Upon substituting a wave profile r(t, k) = r c (k — ct) into the second order differential 
equation for r, r\ = (S + S~ X — 2I)V'(rfc), one obtains the differential difference equation r"(£) = (T + T^ 1 — 
2I)V'(r c ) where the translation operator T is given by (Tx)(£) = x(£ + 1). Taking the Fourier transform 
one obtains the fixed point equation r c = P c N(r c ). Here N(r) := V'(r) — r = \r 2 {\ + r/(r)) with the 
above equation serving as the definition of both N and 77. The operator P is a pseudodifferential operator 

with symbol p c (0 — ^-"inc^f^) • ^ ne can now ma kc the scaling c = 1 + f^/S and the rcnormalization 
r c {-) 1— > (p^-^ := e 2 r c (-) to obtain a sequence of fixed point equations cj)^^ = P e N e ((f) < ' £ '^). Here p( e 'P> has 
the symbol p&(£) = ^ sinc - 2(e ^ 2) — and N e {cf>) := £ - 4 N{e 2 <p) = \4> 2 {l + r]{e 2 (j))). Note that whenever p 

1-1- ^2 /3 sine 

appears with a superscript, it denotes the symbol of a pseudodifferential operator whereas p with a subscript 
is used to denote the momentum component of a solution to @. 

One may now take the limit as e — > and recover the fixed point equation for the KdV wave profile 
<j>p = p(°)JV(°)(0 J g) where P(°) has symbol p (0) (£) = and N^(<p) = \(j) 2 . The crux of the argument 

in [7] is that p e — > p° uniformly on R. It is then an elementary result in the theory of pseudodifferential 
operators that p( £ ^) — > P in the operator norm. This (together with the invertibility of (/ — P o 7V°(0 ) g)) on 
a weighted space orthogonal to the neutral modes which is established from the theory of the KdV equation) 
is precisely what is needed to show that the fixed point equation cj)( E -^ = p( s >0> N e (cf>( s 'P') can be solved via 
a contraction mapping argument. 

In this paper, the collision problem requires precise small e asymptotics for (f>( E -^ and its derivatives in 
the weighted space H\. We accomplish this by regarding the fixed point equation fy 1 ^-^ = p( e <^) N E (^ e '^) 
in the space of even functions whose derivatives up to order 3 are square integrable after multiplication 
by the weight e ax . To solve this equation via contraction mapping arguments we must now obtain small e 
asymptotics for P^-' 3 ) — P° as well as (/ — P°N ((f>p))~ 1 , regarded as operators in the weighted space. To 
that end, we present the following lemma which is proven in Appendix A.l. 

Lemma 3.1. There are constants C > and a > such that for e sufficiently small and any s > the 
following hold 

1. \\P^-P^U( Hi )<Ce 2 , 

2. H^P^ - ^P (0) |U (ff|) < Ce 2 

3. \\d k N%x)-d k N°(y)\\ ciH s ) < \\x-y\\ H . + Ke 2 fork = 0,1,2, 
I || [i-P^dN^i^y'Wc^KC, 

Here, and always, when A is a Banach space, C(X) denotes the Banach space of bounded linear operators 
from X to itself and || • \\c(X) is the operator norm. With the aid of Lemma [3~T1 we may establish the leading 
order behavior for small e of the higher derivatives of the wave profile with respect to c and r. 

Lemma 3.2. There is a constant C which does not depend on e such that for e sufficiently small 

\\d k p ^-d k p M\Hl^<Ce 2 (27) 

and 

\\d k c dir c \\ L 2 a + \\d k di Pc \\ Li < <7 £ 3 /2+i-2fc (28) 

hold with k + j = 0, 1, 2, 3. 

2 If c 2 = ^yf-, then p' = p + 12e 2 (^-) 2 , thus in the small e regime the scalings c 2 = 1 + and ±c= 1 + ^ are equivalent to 
leading order in e. 
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Remark. The weight a/e which appears in the equation for the renormalized wave profiles (|27[) aught to be 
regarded as the renormalized version of the weight a. Recall that a ~ e so a/e ~ 1. 

Proof. We first show that Q28p follows from (j2"T)h Suppose that (J27J) holds. Then since d%4>p doesn't depend 

on e it follows that d^d^cj)^^' — 0(1) in the small e regime for j and k between and 3, in particular 
for < j + k < 3. To obtain the bound on r c and its derivatives in (|28p one merely uses the definition 
r c (x, t) = e 2 <f>( e '^ (e(x — r)), the observation that -p = (D(e~ 2 ) for the scaling c 2 = 1 + fo/3, the observation 
that d T r c (x,r) = e 3 d x (f> ( - £ > /} ^ (e(x — r)), and the fact that |j/(^)||z,2 = e 1 ^ 2 ||/(-)IU 2 a - To obtain the bound on 
p c , write 

r c (-) = 9c(- + 1) - 9c(-) = ^(^£5) = — Pc(^^) (29) 
e e e e c e 

for some ry € (0, 1) which may depend upon x. Here the second equality uses the mean value theorem and 
the last equality uses the fact that wave profiles satisfy the FPU equation to relate spatial and temporal 
derivatives. Now use the fact that for any r\ <G (0, 1) we have p c (£_ + rj) < p c (£ + 1) + p c (Q to conclude that 
||Pc||l 2 < 2c||r c ||i2. Similarly, we can take derivatives with respect to c or r in |29|) to establish the bound 
on p c from the bound on r c in (f2"5)l . 

The rest of the proof consists of establishing that (|27|) follows from Lemma 13.11 We first solve the fixed 
point equation = N £ (4>^) in the weighted space of even functions E 3 . A standard uniform 

contraction principle approach allows one to continue the fixed point at e = to nonzero values of e so long as 
the equation is sufficiently smooth in e, and the linear part of the equation at the base point 4>p is invertiblc. 
Moreover, it is a consequence of the uniform contraction principle that the fixed point (jf'P is as smooth in 
/? as the operator P e ^ and further that the fixed point <j) e 'P is close to the fixed point tfip when e is close to 
0. More quantitatively, \\4> e '^ — <j>p\\ < \\p( £ '^N e — P°A°||. The details of this argument are written out for 
the C 1 case in Lemma A.l in [7j. Here we apply the C 3 uniform contraction principle, rather than the C 1 
uniform contraction principle, but otherwise the arguments are the same. 

Observe that the operator I — P N ((ftp) is a compact perturbation of a pscudodiffcrcntial operator, thus 
its essential spectrum in H s is independent of s. Moreover, from [5] it's essential spectrum in E® is bounded 
away from the imaginary axis and it has no even eigenfunctions in L 2 , thus no eigenfunctions in E^ for any 
s > 0. Thus - P°N O (4> ))- 1 \\ C(E 3 ) < C. From LemmaOwe have ||p( e ^) - P a \\ C (El) < Ce 2 . Thus we 
have the requisite regularity in e and invertibility of the linear part to apply the argument. We thus obtain 
for e sufficiently small: 

1. that G E%, 

2. that \\4>^-^ - 4> p \\ < Ce 2 , 

3. that ||(I— p( E 'P*> N E (x))~ 1 \\c(E 3 ) is bounded uniformly for ||x — <f>p\\ small, and 

4. that (3 i— > (j) 5 ' 13 is thrice continuously differentiable. 

One may now implicitly differentiate the fixed point equations </>( £ ' /3 ) = P^'/ 3 ) jV £ (0( e >^)) and <fip = 
P°N°((f>/s), isolate the difference 9^' e,/3 - ) — d^(j)p, and bound the terms on the right hand side. In the 
case k = 1 we obtain: 

_ dfo = (I- P^dN e {<f>^))- 1 ((P°dN°(<f) ) - P^dN s {^ £ '^))d 0( j) 
+ dpP( e <flN e (<f>t c >fi) - ^P°iV°(^)) . 

Thus in light of (2) above and Lemma |3~T1 we see (after adding and subtracting relevant terms and using the 
triangle inequality) that 

Wdp^ - dpfyWHi <Ce 2 (30) 

as desired. 

Implicitly differentiating again we obtain 

d|</>^ - d}<t>p = (i- p^dN^^))- 1 ^) + (a) + (iti) + (iv)) 

where (i) = (P°dN°((l) )-P( s '^dN £ ((j)^))d 2 (f> , (ii) = d 2 p p( e ^N £ (<f>( e ^ydjP N a (<t>p), (Hi) = 2duP i - e ^dN e ((t)^^)dp^ e 

2a /3 p o aA o (0 (9 )a ( 90 )3 , and (iv) = p^^d 2 N e (^^)(dfs^ E ^) 2 -p°a 2 A o (0 (9 )(a / 30 ( g) 2 . 
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We may add and subtract relevant terms, use the triangle inequality, and bound each term with either 
(2) above, equation ([30|) . or Lemma \3. II to see 

\\d^-d^\\ H ,<Ce 2 (31) 

as desired. 

Implicitly differentiating once more, we obtain 

- dl<j)p = (I- P^dN'i^))- 1 ^) + (ii) + (Hi) + (iv) + (v) + (vi) + (vii)) 

with (») = [p°aiv (^)-p( £ '' 3 )aiv E (^ e '' 3 ))]5|^, (i<) = dlP^N £ (<j>^)-dlP a N°((j) ), (m) = ^p^dN B {<t>^)d P 4>^ 

3d}P°dN\<i> )d <}> , {iv) = MiiPW&N*(tl>W)(de<l>W ( v ) = 

3dpP( s <ftdN e (<l>( £ <ft)d 2 p<t> - 3dpP°dN (^)dlcj)p, 

(vi) = P^)d 3 N{(j)^)(d 4>^\ d <f>( £ <P\ dp^ e '^)-P°^N(4> )((dp^, fy^,, fy^j), and (wit) = 3P^3 2 A^(0( £ <«)(d, 3 < ? (>( e '' 9 > 

Once again, we may add and subtract relevant terms, use the triangle inequality, and bound each term 
with either (2) above, equation (|3"D|) . equation (|3"Tj) or Lemma [57X1 to see 

||a|^) - tffaWB. < Ce 2 

as desired. This completes the proof. 

□ 

We now use these small e asymptotics to establish the existence of coordinates in a tubular neighborhood 
of the sum of two well separated solitary waves. 

3.2 Tubular Coordinates 

For the remainder of the paper we will regard (r, c, V\,V2) as coordinates for the solution u. Here the pair 
(r, c) is shorthand for the four-tuple (t+,t-,c + ,c-). The strategy for proving Theorem 12.21 is as follows: 

1. Let u be the solution of ([2|) of interest. Pick an initial time to, initial phases r*, initial speeds c* , and 
initial localized perturbations v^ a and let V\ solve ([2]) with Vx(to) = w(io) — S a ( u c* (•-?-*) + v 2,a) 

2. Given t > to determine the unique r a = r a (t) and c a = c a (t) such that 

W a (£l,a) («(*) - X! Uc °(' _ T a))ha) = («(*) - - X! " c ° (' ~ T "))^") = °' 

a a 

Define u 2 (t) := - vi(t) - Y. a u c a (t){- ~ r a (t)). 

3. Derive equations for the evolution of T a , c a , and t>2 

4. Use the equations derived in (3) to control the evolution of r Q , c a , and Vi- 

5. Given the coordinates (r, c, t^) constructed in (2) and controlled in (4), conclude that u(t) = J2 a u c a {t) (~ 
r a(i)) + Vi(t) + vi(t) is well-behaved. 

In our application of Theorem 12.21 to the proof of Theorem 12.11 the choice of to, t*, c* , and Q m s * e P 
(1) are naturally given by Thcorcm l2.3l Step 3 is the subject of section 3.3 and step 4 is the subject of section 
4. Step 5 follows trivially from step 4. This section is concerned with step 2. Similar work is done in section 
2 of [5] and Lemma 4 of [TTj, though for a single wave. A key difference is that we must once again control 
the size with respect to e of the neighborhood on which this decomposition is valid. 

First, we state and prove the following lemma, which provides some preliminary estimates on the restriction 
of the symplectic forms to the tangent space of the wave state manifold. It corrcponds to Lemma 2.1 in [8]. 
In what follows the matrix A is given by 

A(t,c)=A q (t,c) + A 1 (t,c) (32) 

where 
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A 0+ (t + ,C + ) \ / U a (£l,a,£,l, a h a ) OJ a (il l!a , & t0l h a 

A {t,c)= ( with A 0a (T a ,c a ) = 

A -(t_,C_) / V UJ a (£ 2 , a ,il, a h a ) LU a (£,2.a,^2, a h c 




A 1 (t,c)=\ with A la {r a ,c a ) 



—a ^a) 

— afoot —a , 



Here as always £j )C( is evaluated at (r a , c a ). 



Remark, in f/ie following lemma, and in the remainder of the paper, we use the notation exp to denote 
terms x which satisfy \x\ < C(e~ em ' T + + e~ £m ' r -') for some constants C and m which may be chosen 
uniformly in e. When the value of r a is allowed to vary we write exp(s) to denote terms x which satisfy 
\x\ < C{e~ em][T+tyS ^ + e -e " l ' T ~( s - ) '). Notice that for any p we have e p exp = exp by making m smaller and C 
larger if necessary. This convention will simplify many expressions throughout the paper. 

Lemma 3.3. There is a positive constant Tq such that if ' r+, — t_ > To then the matrix A = A(t, c, w) given 
by p2[) is invertible. Furthermore, the off-diagonal terms A\ satisfy \\Ai\\ = exp. Moreover, in the KdV 
scaling (fT4|) to leading order in e we have Tq ~ ■ lo f - and 



A- 1 



/ £~ 4 exp exp \ 

exp exp exp 

exp exp e~ 4 

y exp exp exp J 



(33) 



Remark. When A and B are matrices and we write A ~ B, this means that there are positive constants Cij 
and Cij which don't depend on e such that Cijbij < < Cijbij holds. 

Proof. We show that Aq is the sum of a constant invertible matrix, plus a term which is exponentially small 
in the phases t + and r_ which measure the separation between the solitary waves u Ca and the cutoff of the 
localization h a . We also show that the off-diagonal terms Ai go to zero exponentially in the phases r + and 
r_ . This completes the proof. The details follow. 

The matrices Ao )Q , were studied in Lemma 2.1 of [8] for the case a = + and h + = 1. Our result is 
similar though due to the presence of the non-constant function h a the matrices contain exponentially small, 
time dependent components. Furthermore, the collision problem demands that we keep careful track of the 
e-dcpcndcncc of all quantities, unlike [§]. Consider the matrix 



A 0t+ (r+,c + ) 



1 (l 

\FP ; 



^-A 0)+ (r +1 c + ). 



Here A FP is exactly the matrix studied in Lemma 2.1 of j8j- From that lemma, and from Lemma 9.1 of 
[7] it follows both that A FP is constant and invertible, and that A FP ~ [ ^ -2 )i thus (^q 7 ^ 3 ) -1 ~ 



e" 4 e- 1 
e- 1 



For the matrix Aq,+, note that due to the inequality (| 1 9j) we see that w (^„, £j a h— a ) < 



^ll?i,a||f 2 H^.q^-qIU 2 ■ In light of the fact that \^j, a (k)\ < a ' fc T "' we see that 



fc— — oo 



exp 
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In the last inequality we have used the fact that t + > 0. The factor of a in the denominator comes from 
summing the exponential. Thus we obtain that Ao.+ is the sum of a constant, invertible matrix plus a 
perturbation that is exponentially small in |r|. The inequality ||^j,-^+||^ 2 < "f" e "' r ~' a l so holds, thus we 
obtain an analogous result for A _. In fact, these bounds suffice to yield ||^4.i|| < — (e~ aT + + e aT -). In 

/ j{FP q \ 

particular, denoting A FP = ( ° ^ FP \ we obtain 

\\A-A FP \\ < -e- aT " 
a 

for \r a \ > To. Since A FP is invertible, it follows that A is also invertible as long as — e _aT °||( J 4 FP ) _1 || < 1. 
Isolating To we see that A is invertible for 

T o > ~ ( r.nuAFP^u ) ~ ^ lo g(^) ~ ^ l0 S( £ ) ( 34 ) 



a 



K\u Fp r i \ 



Since A 1 differs from {A FP ) 1 only by terms of the form exp, (|3"3")) holds and the proof is complete. 

□ 

The following proposition quantifies the size of the neighborhood of two well separated solitary waves on 
which the symplectic orthogonality conditions (|21[) and (|22p uniquely specify the modulation parameters c 
and r in the ansatz (|20p . Its proof is given in Appendix A. 3. 

Proposition 3.4. Let u and V\ in £ 2 be given. Define 

w(r+,r_,c + ,c_) := u c+ (- -r+) +u c _(- -r_). 
For each r\ > there is a C > which may be chosen uniformly in e such that whenever 

IhH + !!(«-&«, r*,c; jC *) <Ce B/2+fl *e[t 0j ii] (35) 

/or some r* < —To < To < tT and c*_ ~ —I — e 2 < 1 + e 2 ~ c5_, £/ien £/iere is a unique choice of phase r a 
and speed c a such that 

w a (Ci, ai (u - u(T a ,c a )h a ) = w Q (£ 2 ,a, (u - u(t + ,t_,c + ,c_) - = 

holds. In particular, upon denoting i>2{t) — u — v\ — u(t_|_, r_, c_|_, c_) £/ie orthogonality conditions (|21[) and 
(|22p are satisfied. 



3.3 The Modulation Equations 

Having established that studying the flow of FPU near the sum of well-separated solitary waves is equivalent 
to studying the coordinates (r, c, v 2 ) we return to equation (|23|) which describes the evolution of V2, a - To 
completely describe the evolution of the coordinates (r, c, t> 2 ), we must derive evolution equations for the 
modulation parameters r and c. In order to work with modulation parameters whose variation is small, we 
define j± by r±(t) = J t c±(s)ds + 7±(i) and we study the evolution of 7 and c. Modulation equations are 
obtained by applying u> a (£i, a , •) to (1231) and moving all terms which do not depend upon c or 7 to the right 
hand side. The following lemma makes this precise. 

Lemma 3.5. Let t Q < ti be real numbers and suppose that the ansatz (|20[) and orthogonality conditions (|2ip 
and (|22p are valid for t £ [t ,ti]. Then 

U a (&,a, L a V2, a ) = N 2 , a + £2,0, (36) 

and 

Wa(€l,a, L a V2, a ) = Ni >a + 4,a (37) 

hold for t G (t Q ,ti). Here £j iQ! is evaluated at (r a (t) , c a (t)) , L a V2, a is evaluated at t, and the quantities £ i a 
and iVj )Q , are given by 

il,a = -jaUJaidr^i^^Va) - C a LU a (d c £l >a , V a ) (38) 
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h, a = -7aW a (9 T 6,a^2,a) ~ C a UJ a (d c &,a, V 2 , a ) 



and 



Ni, a = — W a (£i )0 , L a Vl, a ), N 2 , a = —Vu(€l,a,V2, a ). 

Proof. The proof proceeds by implicitly differentiating 

^a(^l,a(r a (t),C a (t)),V a (t)) = U a (£2,a(T a (t), C a (t)),V 2 , a (t)) = 

to obtain 



(39) 
(40) 



and 



T a uj a (d T ^ a ,v a ) + c a u a (d c £i >a ,v a ) +uj a (t;i, a ,dtv a ) = 

,oli ^2, a) Ca^a 



(41) 
(42) 

Use the fact that d T £, 2 ,a = -^-JH"(u Ca )^ 2ta — -j=-£i jCe , that oj a (Jx,y) — ~oj a (x, Jy), and that T a = c a +j a to 
rewrite (|42|) as 

Wq(C2 : q, L a V 2ia ) - U a ({;i, a ,V2,a) + A /^a{d r ^2,a, V 2 ,a) + CaW a (9c6,c>, «2,a) = 

which is exactly (|36|) . Similarly use the fact that d T £i, a = — JH" (u Ca )£i ta , that uj a (Jx,y) = —ui a (x, Jy), 
and that t a = c a + j a to obtain 



which is exactly (f3"T|) . 



□ 



With the aim of using Lemma 13.51 to obtain equations for c a and j a , define £i >a := u} a (£i,a,£a) and 
Ni ta := w Q ((,. Q , (Jg a )h a ) where ^ Q and g a arc defined in Q25[) and (|26[) respectively so that upon applying 
w a (£i,ai ') to equation ([23]) and using Lemma T3. 51 we obtain a system of four ordinary differential equations 
for c a and j a which we may write in components as 



and as a system 



A 0+ - B + A 1+ 
Ai- A - - B- 





i = 


1....4, 




( i+ ) 




/ iVi,+ - 




c+ 






N 2 ,+ 


7- 




JVi,- - 


Ni,- 


V J 




^ N 3 ,- - 


N 2 ,- j 



(43) 



Here Ai^ a is defined in (|32p and studied in Lemma 13.31 The nonconstant parts of A, jQ correspond to the 
inhomogeneous term which appears on the right hand side of the perturbation equation (|23p and arise 
because the modulation parameters c a and r a are allowed to vary. 
The matrix B a , which is given by 



B a 



corresponds to the inhomogeneous terms which measure the failure of the range of the differential operator 
L a to be symplectically orthogonal to the tangent vectors and £2,0- So long as the smallness condition 
(|35|) holds, it follows that 

A d ( £ 5/2 (lkl,a||w a + ||w2,q|| Q ) £ 

Here we have used (|28|) . (|3"5]) . the fact that || • || bounds || • || w a , and the fact that u Ca is exponentially localized. 
Moreover, as before the off-diagonal terms are not only bounded above by a multiple of e but also bounded 
below by a multiple of e. Thus so long as (|33|) holds, Ao ta — B a is invertible. Let A denote the matrix on 
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the right hand side of (|43|) . In light of the fact that the off-diagonal terms of A are exponentially small, we 
see that A is invertible and to leading order in e is given by 



A~ 



exp exp 



e 1 £ 1/2 (|K,+ ||w + + ||v2,+ || + ) exp exp 
exp exp £~ 4 e -1 

V exp exp e- 1 ^(IK-lk- + K-H-) / 



Thus we have proven 

Lemma 3.6. Assume that (|35|) holds. Then there is a constant K which does not depend on e such that 
\c a \ < K (e^N^ - N ha \ + e^dl^H^, + ||«a, a || a )|JV a>Q - N 2 , a \) + \N it _ a - N it _ a \ j exp (44) 



and 



|7a| <^(e _4 |iVi,a-JVi, Q |+e- 1 |JV 2 , Q -JV 2 , Q |) + j ^ \N it - a - N it - a \ J exp 



(45) 



We now refine Lemma l3.6l bv obtaining quantitative estimates for the quantities which appear on the right 
hand side in (HI) and (l45l). 



Lemma 3.7 (Bounds on the Right Hand Side). Let g a be given by (j!?6"|) . Assume that ([55]) holds. Then the 
following hold 

\\(Jg a )h a \\ a < Ke 2 \\v 1 , a \\ Wa + K(\\ ) + exp (46) 



and 



\N lta - N 1<a \ < Ke 3 (\\v ha \\ 2 Wa + \\v 2 , a \\l) + exp 

-l/2( 

In particular, combining equation (|44j) and ()45j) wit/i equations ()46|) . (|47j) . anc? (|48j) we obtain 

\c a \ < K(e 2 + \\v ha \\ + \\v 2 , a \\)(\\vi, a \\w a + \\v2, a \\ 2 a ) + exp 



|iV 2 , a - iVa,^ < A' ( £ 3/2 1 1 Wl , a 1 1 W a + S~ L/ * ( | \v 1<a \ \ + \ \ V 2 , a \ | ) 1 1 V 2 , a \ | a) ) exp 



and 



-3/ 2( || 

^l,a|| H~ || ^2, a || ) || ^2, a || a 



Proof. We first estimate g a h a . 

Let G(u, v) := H'(u + v) - H'(u) - H"(u)v so that 

g a = G(u Ca ,v a ) - G(0,v a ) 

+G(v la ,v 2a ) + (H"(v la ) - l)v 2a 
+(H"(u c J-l)v la 



d) 

(II) 

(HI) 



+ - H'(u c _ a ) + H'(vi, a ) - H'(vi) + H'(u c+ + u c _ + v) - H'{u Ca + v a ) {IV) 



(47) 
(48) 

(49) 
(50) 



(51) 



Since H is smooth, term {I) is order u^v 2 . Since H"{0) = Id, term (77) is order |«2,a|(|^i,o| + l v 2,a|) and 
term (III) is order vi i0 u Ca . Term (IV) becomes exponentially small in the weighted norm upon multiplying 
by h a . 
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We compute the terms (I) - (IV) in turn. 

imil < K ^2kez e 2a ^~ Ta) u Coi (k - T a ) 2 v a (k) 4 

< A^IKIIS- Ekez^ 2{K ~ a)lk ~ Tal v a (k) 2 

< K^\\v a \\^(\\v lia \\^ a + \\V2,41) 

< K^ kez e- 2 ^ k -^v 2 , a (k) 2 (v 1 , a (k)+v 2 , a (k)) 2 

< ^(||^, tt || + ||^ tt ||) 2 ||^a||L 
\\III\\a < K£ 2 \\v ha \\ Wa 

\\IVh a \\ a = exp. 

In estimating ||/|| 2 we have used the fact that n > 2a. In light of the fact that the a norm dominates the W a 
norm, the contribution from ||/|| a is dominated by the contributions from ||//|| a and ||i77|| Q . In particular 

\\g a h a \\ a < K (£ 2 \\vi, a \\ Wa + + \\v2, a \\)\\v 2 , a \\ a ) + exp) . (52) 

Observe that 

(Jg a )h a = J(g a h a ) + [J, h a ]g a (53) 

where [■, ■] is the Lie bracket. Thus ||(J<7a)/ia|| Q < if||<7 a fta||a + II [J, h a ]g a \\ a . Here we have used the fact that 
J is a bounded operator. We first examine the boundary terms. For any vector x, the bracket [J, h a ]x is 
localized at the lattice sites —1 and 0. Thus in the weighted norm we have 

\\[J,h a ]x\\ a <e- aa ^\\x\\^ <e- aaT "\\x\\ 

and in particular 

|| [J, h a ]g a \\ a < Ke~ aaT «(l + \\v\\) = exp. 

Here the ||i;|| term comes from terms (I) - (III) and the constant term comes from (/V). Thus (|46|) follows 
from ([52]) . 

Moreover, in light of the facts that Lu a (x,Jy) = —(y,x) and [ J, h a ]g a — exp we can use the Cauchy- 
Schwartz inequality to obtain N 2 ,a < £~ 1 ^ 2 \\gah a \\ a + exp. Similarly, we can use (|2"Tj) and (|2"5j) to see that 

N 2 , a =U a (€l, a ,Vl, a ) = (H'(u Ca ),V ha ) < Ce 3/2 1| V ha \\ Wa ■ 



Equation (|48[) now follows from (1521) . 

We now estimate Ni <a — Ni >a = —u>a{£i,a, (Jga)h a + L a Vi tOC )- Define 

g a := H'(u c+ +u c _+v)-H'(u c+ )-H'(u c _)-H"(u Ca )v 

= H'(u Ca +v a )~H'(u Ca )-H"(u Ca )v a (I) 
+H'(u c+ + u c _ + v) - H'(u Ca + v a ) - H'(u c _J - H"(u Ca )v- a (II) 

and 

g a :=H'(v 1 , a )-H"(u Ca )v 1 , a (III) 

so that (Jg a )h a + L a vi t0l = (Jg a )h a + (Jg)h- a . 

Observe that ||(7/)ft. a || a = exp and gh^ a = 0. Since multiplication by J and multiplication by h a 
commute up to error terms at the lattice sites and 1, the contribution of terms (II) and (III) to || (Jg a )h a + 
(Jg)h- a \\ a is localized at the lattice sites k G { — 1, 0, 1}. Thus this localized error term is exponentially small 
in the weighted norm. Upon taking the symplectic inner product with £i )Ct we see that the contribution of 
(II) and (III) to N ha - 7Vi jCt is exp. 
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Since H is smooth we see that (I)(k,t) < Kv a (k,t) 2 where K may be chosen to be independent of k, t, 
and £ and to be increasing in c a and ||va||, thus may be taken to be constant as we have assumed c a and 
||v Q || to be bounded above. In particular, 

((JUi,a) < Ke^kez^-^Kikf 

< ^(ll^allk + H^II^) 

In the second line we have used that K > 2a. In the last line we have used Young's inequality. We may 
once again use the fact that uj a (x,Jy) = —(y,x) together with the fact that (Jg a )h a = J{g a h a ) + exp to 
see that the contribution of term (I) to \Ni a — Ni, a \ is order £ 3 (|jwi. a ||iv Q + ||w2, q ||q) 2 . Combining these 
contributions we obtain (|47p 

This completes the proof. 

□ 



4 Stability and the Proof of Theorem 12.21 

This section is concerned with controlling the evolution of ||v(t)||> Lemma T4.ll controls Lemma T4. 2 1 

controls in terms of \c(t) — c(to)\ and ||v(£o)||, and Theorem [131 uses these bounds to control \c(t) — c(to)\ 

and thus ||u(i)||. 

We begin by controlling ||ui||w a . The following Lemma is a version of Lemma 9 in [llj where it was 
adapted from [T^]. The only difference is that we pay attention to the small e asymptotics of the constants 
and exponents. 

Lemma 4.1. Let a > be given. Let 6 : K — ► (0, 1) be C 2 smooth and assume that A := sup xeffi sup 0<(5<a 9 g, x ^^ < 
oo. Then there are constants Co, G\, and C2 which may be chosen independently of a such that whenever 
t : R — > K satisfies f > 1 + Cqo, and v is a solution of ^ with ||f(to)|| < C\a then for any t > to we have 

(C 2 /o) V>0', t)\v(j, t)\ 2 + I Y, s M-h s)\ 2 dt < (C a /o) £ <P(j, h)\v(j, t )\ 2 

where ip(x, t) = 9(a(x — r(i))), and ip'(x,t) = a9'(a(x — r(t))). In particular, choosing 9{x) = 1 + tanh(x), 
and a « e 2 we obtain 

2Ce- i U(t) 1,2 viit)\\ 2 + f hi{s)f Wa[s) ds < C £ - 4 || Vl (t )|| 2 (54) 

J to 

Remark. The choice a « e 2 is necessary for the result to be valid when f — 1 ~ e 2 . 
Proof. Sum by parts to obtain 

jGZ V_J J j'GZ 

Thus we may write the right hand side of the above as 

RHS = E ^(Pi + rjWi-PiTi-Mi ~ *i-i)-Pi<y\ri-i) - rj-i)W>j ~ 1>i-i) + (Vfa) - \r 2 )^ 

In light of the convexity of V and the conservation of the Hamiltonian, there is a constant C which may 
be chosen uniformly such that 

\V(r) - ir 2 | + \V'(r) - r\ < C\\v(t Q )\\r 2 < Gb^-r 2 for \\v(t )\\ < Cia 
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holds. Here we are using ||v(t)|| as a cheap bound for rj{t) and using the conservation and convexity of the 
Hamiltonian to replace ||i>(t)|| with ||u(io)||- We have also absorbed the constant C\ into the constant C. 
Now we have 

RHS < \ £ . £Z -(f - Ca){pj + + (1 + Ca)\p jrj ^ - 

< \ E, eZ -(r Ce*){p) + rjWj + (1 + Ca){^^\p)^) + Ca)^^^) 

< -\ Zjezir - 1 - o(l + A + C(l + aA))(p] + r|)^ 

In the second line we have used Young's inequality and multiplied and divided by ipj. In the third line we 
have used the mean value theorem and the definition of A. In the last line we have used the assumption 
f > 1 + Co a and have chosen Co large enough. 

Now integrate and use the fact that V(r) > r for r small to obtain the result. 

□ 

The following energy estimate is a consequence of the convexity of the Hamiltonian. 

Lemma 4.2. Let to and t\ be real numbers. Denote m = min{£o,£i} and M = max{io,£i}. Suppose that 
u = u c+ + u c _ + v solves ([2|) for for t G [to, M]. Suppose also that there are wave speeds and phases c(ti) and 
t(U) such that U} a (£i,a(T a (ti),Ca(U)), v a (U)) = for i = 0, 1. 

Then there are positive constants Si and K such that whenever < Si for i = 0, 1 then 



IK*i)ll 2 <# 



IHio)!! 2 + E £ \ c ^ ~ c ^o)\ + e- KQT ° ( * o) (l + IK(*o)ll) + e— + IM^H) 
«£{+-} 

(55) 



Proof. Since the Hamiltonian is convex in a neighborhood of u c+ + u c _ , there is a Si > 0, and constants K_ 
and K + such that 

^A'-|M| 2 < H(u c+ +u c _ +x)-H{u c+ + u c _) - H'{u c+ + u c _)x < K+\\x\\ 2 (56) 

whenever ||x|| < Si. Thus we may write 

iil_|K£i)|| 2 < [H(u c+ +u c _ +v) -H(u c+ +u c _)- < H'(u c+ +u c _),v>\ ti 

= [H(u c+ +u c _+v)-H(u c+ +u c _)-<H'(u c+ +u c _),v>\ to {I) 

+ H{u c+ +u c _+v)\ ti - H{u c+ +u c _+v)\ to {II) (57) 

+ H(u c+ +u c _)\ to - H(u c+ +«c_)| tl {HI) 

(H'{u c+ +u c _),v)\ to - (H'(u c+ +u c _),v)\ ti {IV) 

We immediately sec that ||(/)|| < i^||w(to)|| 2 from (j5"6")) and the fact that ||u|| < Si; we also see that 
{II) = because the Hamiltonian is conserved along solutions. To estimate (III) we write 

H{u c+ +U c _) = J2a€{ + ,~}( H ( U c+ + u c^) - H{u Ca ))h a (l) 

+ E ae {+,-}O ff ( u cJ -H(u a ))h a (m) 

+ T, ae {+.-} H «) (*«) 
+ Y, a e{+,-} H (u<x)h- a (iv) 
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where u a (k, t) := u Ca (t )(k—T a (t)). It is a consequence of the mean value theorem that || (i)|| < KJ2a£{+ -} \\ u c^ a h a \\ < 
K(e~ KT + + e KT -). From Lemma 9.1 in [7] we know that dH ^ a ^ ~ e, thus by the mean value theorem we see 
that ||(m)|| < KzJ2a£{+ -} \ c a(t) — c a (to)\. Since the Hamiltonian is translation invariant, we may replace 
(Hi) with H(u Ca ( to ^(k — c a (U))t)) which is constant in time, thus does not contribute to (III). Term (iv) is 
exponentially small because u a h a is exponentially small and H(0) = 0. Thus we obtain 

\\(IH)\\< e- KaT " (tl) +e- KaT « (to) +£|c Q (ti)-c Q (i )|. 
ae{+,-} 

To estimate (/V) we write 

(H'(u c+ +u c _),v) = J2 {(H'(u c+ + u c _) - H'(u c J)h a ,v a ) < K(e-^\\v+\\ + e KT -\\v.\\). 
«£{+,-} 

Here we have used the mean value theorem and the fact that (|21[) may be written as (H'(u Ca ),v a ) = 0. 
Summing the bounds I through IV we obtain 

Mto)\\ 2 + J2 £ l c «(*i) - c -(*o)| + e- KaT ^\l + \\v a (t )\\) + e -— fe)(i + || Va ( tl )||) 
«£{+-} 

as desired. □ 



Mh)\\ 2 <K 



We now study the semigroup associated with the linear variational problem about a solitary wave in the 
Toda lattice. 

Proposition 4.3. Let V be the Toda potential V(x) = e x — 1 — x. Fix c* > 1. Then there are positive 
constants K , b, and a such that any solution w of the linear equation 

d t w = JH"(u c )w (58) 

which also satisfies the orthogonality condition 

(J~ 1 d x ii c , 7 w) = (J~ 1 <9 c it c | c=c «,u;) = (59) 

necessarily satsifi.es the decay estimate 

||e a( - ct V(f)|| < Ke- b{t - toS >\\e<- ct ^w(tQ)\\ (60) 

Moreover the constant K can be chosen uniformly in c (and hence e) while the constants a and b satisfy 
a = 0(e) and b = 0(s 3 ) in the small £ regime for the scaling (|14p . 

Remark. Here the inner product in (|59p is the dual pairing between £_ a and £„. The Neumann series form 
for J -1 reflects this: (I — 5) _1 is the sum X)fc°=o ^ k which has norm 1 _^_ a as an operator from (. 2 _ a to itself. 
This result is for rightward moving waves. Of course, to examine leftward moving waves, one takes (•, •) to 
be the dual pairing between £ 2 and £ 2 _ a and defines J -1 = — Y^k=i S~ k ■ 

The estimates in Proposition ^. 3l are proven in [T5], with the exception of the fact that K is independent of 
c, which turns out to be crucial for our proof of Theorem l2.2l We establish this by studying the e-dependence 
of the Backhand transformations constructed in [13| . Details are provided in Appendix A. 2. 

We move now toward the main stability estimate. Our aim is to use the exponential estimate (|60j) to 
solve the equation (|23|) via the variation of constants formula. Note that equation (|59|) may be rewritten 
in our notation as w+(^(c5_, t+), w) = for i — 1,2. This differs from the orthogonality conditions (|2"Tj) 
and (|22[) that we impose on vi. a in three important ways. First, is evaluated at a fixed wavespeed c* 
rather than at a modulated wavespeed c a (t) and second, the condition u> a (£i tCC , V2,a) = is imposed rather 
than uj a (^i. a ,V2,a) = — w a (Ci,a, vi,a)- Third, we consider any potential V which satisfies (HI) which is more 
general than restricting to the Toda potential. More concretely, we have only proven that the estimate (f60|) 
holds for perturbations which are symplectically orthogonal to the tangent vectors of the wave state manifold 
for the Toda model and at a fixed reference wave speed c* . Our perturbation V2, a satisfies different conditions, 
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(|2ip and ([22]) , which impose symplectic orthogonality to the tangent vectors of the wave state manifold for 
the FPU model and at a modulated wave speed c a (t). 

Furthermore in Proposition 14.31 we have studied the semigroup for the Toda lattice linearized about a 
solitary wave while the linear part of our evolution equation generates a different semigroup - that of a more 
general FPU model linearized about a solitary wave. Thus the solitary wave profile u c which appears both 
in the orthogonality condition (|59p and the evolution equation Q58[) will differ between this proposition and 
our later analysis. 

The reason for these differences are that ([2"T]) and ([2"2"]l are essential for obtaining good bounds on the 
modulation equation while (|59p is the most natural condition (and the one used in [10] and subsequently in 
[T3]) for extending the KdV semigroup estimates of [5] to the FPU context. Moreover, it is useful to work 
with the Toda potential because the Backlund transformation constructed in [13| allows us to obtain uniform 
estimates in the small e regime for the constant in ([60]) . These differences contribute additional terms to the 
evolution equation for v(t) which we bound and regard as small inhomogeneous terms on the right hand side 
of ([23]) . This is addressed in Lemma l4~4l 

We now define a map which takes as an input V2, a satisfying (|21[) and (|22p and gives as its output a V2, a 
satisfying ([59]) . Given an initial time to, define £i, q (t) := d x u^ (t )(" — T ) an d £2,a( T ) := d c u^ (t )('~ T ) where 
the superscript T denotes that these are solitary waves for the Toda lattice. Given V2. a satisfying (|2ip and 
(|22l) we seek V2, a close to V2. a which satisfies the conditions ([59]) which we write as 

u> a (ii, a ,v 2 , a ) = i = l,2 

so that we may apply the semigroup estimates ([50)1 of Proposition 14.31 to control the evolution of v 2 ,a- 
To that end, we define a projection Q a onto the symplectically orthogonal space as follows: 

Q a V~V = = = = ^i, a = = ?2,a (olj 

Wa(€2,u,£l,a) ^0(^2,0,^,1, a) 2 W a (£l, a , £2, a ) 

We remark that the denominator u> a (^ a ,^2,a) is bounded below by a multiple of e as was shown in Lemma 
9.1 of and Lemma 2.1 in [S]. 
Define 

Lemma 4.4. Suppose thatv2, a satisfies (|2ip and (|22p and \c(t) — c(to)\ < Ce 3 . Then for s sufficiently small, 

11^2, a || a <C(\\Q a ^2, a \\a + \\vi,a\\w a ) (62) 

Moreover 

[L a ,Q a }=0 

Furthermore, there is a C which is independent of e such that ||Q a a;|| Q < C||.t|| q 

Proof. To see that the norm of Q a is bounded uniformly in e, apply to ([rTT| . We now establish (JHU). We 
estimate 

UJ a (^2,a,^l,a)(I - Qa)v2,a = W«(6,0! U2,a)^l,a (*) 



Observe that since the wave profiles u c and are both equal to the KdV wave profile to leading order in e, 
their difference and the derivatives of their difference is higher order in e. To quantify this use (|27p and the 
triangle inequality to obtain 

\\d k c di(u c -u T c )\\ a <C £ y^' 2k , (63) 



which is reminscent (f2"5)l . From the above, it follows that 

||fa,o - &,alU < \\d c u c{t ) - d c u c{to) \\ a + \\d c u c(to) - d c u T c(to) \\ a < C (e- b/2 \c{t) - c(t Q )\ +e 3 / 2 ) . 
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here we have used (|28| and the Mean Value Theorem to bound the first term and ([63]) to bound the second 
term. Similarly, we obtain 

llfi.a - < \\d T u c{t) - d T u c{to) \\ a + \\d T u c(to) - d T u T c[ta) \\ a < C (e 1/2 |c(t) - c(t )\ +e 9/2 ) . 

Use (|28|) together with the above and 

and 

U a (£,2,a, v 2,a) = l^>a(S,2,a ~ £,2,a, V2,a) 

which follow from the orthogonality conditions (|21|) and ([22]) to obtain 

W<(e- 3/2 \c(t)-c(t )\+e 5 / 2 )\\v2, a \\ a 

and 

(«) < Ce\\v lta \\ Wa + Cie-^cit) - c(t )| + £ 3 )\\v2, a \\ a 
Summing these estimates and dividing by w Q (^2, a , £i,a) ~ £j (which is known from P3), we obtain 

II (I - Q a )v2, a \\ < C\\v ha \\ Wa + C (V 5/2 |c(t ) - c(t)| + e 3/2 ) KalU 

After applying the triangle inequality and using the fact that e _5 / 2 |c(io) — c(t) | << 1 we obtain (|6l?)) as 
desired. 

We now show that Qq, commutes with L a . If the following computation H := Hxoda- Write Q Q w = 
u-OiWa(6,aiV)li,a-a2W a (li,a,«)Ci,a + Oiw(Ci,a)V)|2,Q where a! = — -^-^ — - and a 2 = "°| 2 '°f'° > - Note 
that ai and 02 are constant in t because £j iQ are evaluated at a fixed wave speed c* . 

Also recall that <9 t fi, Q = ^J-ff"(u co )£i,a and d t £ 2 ,a = ^JH"(u C0 )i 2 , a ~ 
One may compute 

Ag^ = 9 tW _ oiw a (f2,a,-£o«)fi,a + ^w a (|i, a , u)|i, Q - a 2 w Q (fi, Q , Z a w)|i jQ! + aia; a (fi, a , L a v)£ 2t0l 
-± JH "( Uco ) (v-Qv) - ^w a (| lia ,«)|i, a 

= ^JH'"(u Ca )v + L a V - OiWQ,(| 2 ,a,-&a«)ll,a ~ ^Wafe.a, £ a «)ll,a + aiW a (|l, a , ^ai^&.a 

Thus L a Q a v = Q a L a v as desired. □ 
Thus 

)v 2 .a 

= Oa ({Jg a )h a +£ a + {L a - L Q )w 2 ,a) (64) 
:= G Q 

Let 'J'q be the evolution semigroup associated to equation |58|) . Then solutions of (l64|) have the following 
representation via the variation of constants formula. 

Q a V 2 ,a(t) = ^ a {T a (t),T a (t ))Q a V2, a {t ) + / ^ a {T a (t) , T a (s))G a (s)ds (65) 

In light of the exponential estimates on ty a given by (|60p we axe now in a, position to control |j^2,o||a- 



24 



Theorem 4.5 (Stability Estimates). Assume (HI). For each rj > 0, there exist positive constants C* and <5* 
which may be chosen independent of e, with the following property: 
Suppose that u is a solution of (|2|) and 

u(t) = U c+ (t)(--T+(t))+U c _(t){- -T-(t))+v 1 (t)+v 2 (t) t G [t ,h] 

Suppose further that the solutions satisfy 

\c(t) - c(t )\ + \\v 2 (t)\\ + ||wi(t)|| + K+(i)ll + + K-WII- < Ce 3+ " for all t e [to.ti]. 

TTien /or all t £ [ioj^i] ^ e following inequalities necessarily hold: 

\c a {t) - c a (t Q )\ < C [e- 4 |M; )|| 2 + s-^MWl] + exp(t ) (66) 

and 

\\v(t)f < C [ £ - 3 ||t'i(to)!| 2 + \\v 2 (to)\\ 2 + \\v2,+ (t°)\\% + K-(*o)ll-] + exp(io) (67) 

and 

\\v2At)\\l < C [e- 3 ||^i(to)!| 2 + \\viAt)\\w a(t) + e-^-^WviMWl + e~ 2U ] . (68) 

Proof. We use the variation of constants formula (f6"5| to solve (|6"4"|) . From ([5(7)1 and the smallness assumptions 
in the statement of the theorem it follows that j-yl < Ce 7 ^ 2+V for t <G [to, t±\. This together with the smallness 
assumption on the variation in c yields f a (t) > c a {tQ)-\-c a {t)—c a {to)+^ a {t) > 1+Ca. Thus T a (t)—r a (s) > t—s 
whenever to < s < t < ti, and in addition the use of (|54[) is justified. 
In light of the decay estimate (|60|) we have the estimate 



\\Q a v 2 , a {t)\U < Ce" *-'"' || Q a v 2 , a (*o ) 1 1 Q + C / e - a6 «- s > || G Q (s) 1 1 (69) 



For ease of notation we have replaced r Q (t) — r Q (s) with i — s in the exponents. This is justified by the remark 
just prior to (f6"9"|) . Use (|62[) as well as the fact that the norm of Q a is bounded uniformly in e to obtain 



||«2,aW||a<Ge- a6 ( t - to )||« 2 , a (t )||a+C||i;i, a (t)|| W r a +G / e~ ab ^ \\G a (s)\\ a ds . (70) 
In order to proceed we make the estimate 



to 



(71) 



||Ga(s)|| Q < C\\{Jg a (s))h a \\ a + C\\£ a (s)\\ a + C(\ (*o)| + |7a( s )l + £ 4 )ll w 2,a||Q 

< Ce 2 |Ka||w«, + C(||«i ia || + Ka||)||«2,a||a + ^~ 1/2 |ca| + e 5/2 |7a| 

+ + \c a (s) - c„ (t ) I + £ 4 )||v2,a||a + exp 

< Ce 2 ||ui,a,||w a + Ce\\viA\w a + C^ll^alla 

+G(||vi ja ,|| + \\v 2 , a \\ + \c a (s) - c a (t )\ + s 4 )\\v 2:a \\a + exp 

< Ce 2 \\vi. a \\w a +Ce s+, i\\v 2 . a \\ a + exp te[t ,*i] 

In the first line we have estimated Q a (L a — L a )v 2:a in (|64)) and also used the fact that \\Q a \\ is bounded 
uniformly in s. In the second line we have used equations (|46[) and (|25p . In the third line we have used 
equations (|4"9"|) and (p)U|) . In the fourth line we have used the smallness assumptions in the statement of the 
theorem. 

Substitute (fTTj) into ([7D|) to obtain 

||«2,a(t)|| Q < C e -foC*-*o)|j^ 2jCt ( to )|| Q + C||^ l!Ct (*)|| w ^Ce 2 // o e- fc < t - s )|K; 1 ^(s)|| w „ (s) rfs 

+s 3 +*if t t o e- b (- t -^\\v 2 , a (s)Lds + exp 
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Apply the integral form of Gronwall's inequality to obtain 



\v2, a (t)\U < Ce c (e 3+ ^-*- Ht - t0) y b [\\vUmw a{ t)+e 
+e 2 f?e- b (t-s)\\v l!a ( S )\\w a ds 



-b(t-t )\ 



"2,a(*o) 



-bt 



exp(to) 



< C[\Mt)\\ Wa{t) +e- b ^\\v 2 M\\ a 



(72) 



-hi 



exp(< ) 



+ £ 2J t t o e- b ^\\v 1 , a (s)\\w a ds 
Here we have used the fact that b = C(e 3 ) > Ce 3+V for e small enough. Use Young's inequality to obtain 
KaWHS < C-[|| Wlja (t)||^ (t) + e- 26 (*-*o)|| W2ja (t )||2 

(/** e- 6 < e — >||wi,<,(s)|| w -„(.)ds 

Now compute 



(73) 



-26* 



exp(t ) 



/to(/ t o e " fe(t " S) ll W ba(«)lk„W^) : 



* < J?(jie- 6 (*->*»j;e-««-)||^ ia W|| 



ds 



In the second line we have used the Cauchy-Schwartz inequality and changed the order of integration. 
Recall that b — 0(e 3 ) and substitute the above into (JT3J) to obtain 



\v2, a (t)\\l dt < C 



\vi, a (t)\\w a (t)dt + £ ;5 ||«2, Q (io)||a + e bt exp(t 



Apply Lemma and in particular (|5"4")) to see that 



\\v 2 At)\r a dt < C e- b |MM||^ (to) + z-l\v2, a (to)¥ a + exp(to) 



(74) 



(75) 



Integrate equation ([4U)) and use (|75|) and ([54")) to obtain ([55)) . Substitute into equation ([55]) to obtain 
(f67|) . To obtain (f68]) . apply the Cauchy-Schwartz inequality to the integral term in (f73|) and use (|54|) . This 
completes the proof. □ 

Remark. It might appear from the proof that the cross terms associated with u c _ a do not appear in the 
estimates for v a . All of these cross terms are exponentially small, and incorporated into the term exp. 

Theorem 12.21 may now be regarded as a corollary of Theorem 14.51 

Proof of Theorem \2.2[ Proposition 13.41 applies as long as ||v2.+|| + + |K'2,-||- + IK'il < Ce 5 ^ 2+V . Since this 
inequality is satisfied by hypothesis at t = to and we have continuous dependence on initial conditions, it 
follows that tubular coordinates remain valid on some interval [io,ti] with t\ > to- Thus we may apply 
Theorem 14.51 to obtain (p5|) . ([57| . and ([55)) for t € [iojii]- It is straightforward to sec that so long as 
IMMII < Ce 9/2+11 and ||u 2) a(*o)||a < Ce 3+r > with < T) < 1/2 then wc may take t x = oo. In particular JSTJ) 
holds for all t > to. This establishes (??). That c a converges follows from the fact that c a is intcgrablc. 

From ([50]) we see that to show that 7 — > 0, it suffices to show that ||t>i,a||w Q and ||u2,a||a each go to zero. 
To see that ||vi,a(i)||w a (t) — > wc compute 

j t \\viAt)\\w a( t) < HmWJH'M))^ + \r(t)\\\Mt)\\w a (t) < C 
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This, together with (|54[) and in particular the fact that ||wi ja (i)||w a is square-integrable imply that ||i>i, Q (t)|| w a (t) 
as desired. Similarly, 

jT||«2,a|la < 2aaf Q ||w2 iQ ||a + 2||« 2l a||a||9tW2,o||a- 

Since V2, a solves (|23p and (|68p holds, all of the terms on the right are bounded; this together with the fact that 
ll^alU is square-integrable implies that ||u2,a(t)||a ~~ * as t — > oo as desired. It remains only to establish 
(|10[) and (jTTJ) , both of which follow from (|6"8)) . This completes the proof. □ 
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A Uniform Bounds in the Small e Regime 

A.l Uniform Bounds in the Small e Regime for Higher Derivatives of the Wave 
Profile 

Proof of Lemma \3.1\ First we observe some facts about the weighted space H* with norm ||/(-)||_ff s = 
ll/OII*' + ||e a '/(-)|k-- For L G C{H s a ) wc estimate \\L^\\ H s = \\L^\\ H s + \\L a ^ a \\ H s where L a := e^her" 
and Va(-) = e a -ip{-). Thus 

\\L\U(H» a ) < \\L\\ C (Hs) + \\La\\c(H°)- (76) 

Also observe that whenever P is a pscudodiffcrcntial operator with symbol p, then ||P||£(.fp) < ||3?||i,°°(iR,c) 
for any s > 0. Furthermore, if we denote the symbol of P a := e a ' Pe~ a ' by p a then f> a (£) = p(S, + id). Thus 
to bound the operator norm of a pseudodifferential operator (as in (1)) in H% it suffices to bound its symbol 
uniformly in a strip |Im£| < a in the complex plane. A slight modification of Lemma 3.3 in [7] yields this 
bound. 

More explicitly, let p(£) = c 2 b ^ a ^(^/2) w i^ n c 2 = 1 + e 2 /3/12 Denote by the critical part of p given by 

contribution of residues. Friesecke and Pego compute £>*(£) = g^pg^ro^feg^ ) ) where a\ and (3\ are analytic 
functions. Lemma 3.2 in [7] establishes that 

C 

\p(0 - P*(0\ < -r— W for l Im £l < b* with K < k 
i + tr 

Here the symbol p has poles at ±zk and is analytic on the strip |Im£| < k. 

One may now define po(£) : — gi+ e -i as i n the statement of Lemma 3.3 in [7] and compute 

12«i(^ 2 ) Pi(e 2 ) 

p*(0-po(0 - 



(d/eY + 1 + e^^) ((C/e) 2 + l)((£/e) 2 + 1 + £ 2 /?i(e 2 )) 

In particular — Po(s£)l is bounded uniformly in a strip of width rj for any 77 < 1. In particular, we 

take 77 = 1/2. 

It now follows from the triangle inequality that |p(e£) ~Po( £ £)l is bounded uniformly on |Im£| < 1/2. 
Now observe that p e (£) — p°(£) = e 2 (p(e£) — Po( e £))- This proves the first assertion in the statement of the 
lemma. 

To prove the second assertion differentiate p°(£) = gj^g and p e (£) = ^ + n^^i^^T^ with respect to /? 
to obtain 

= (-l) fc fc!(£ 2 + /3)- ( *" V(fl and d k f(0 = {-l) k k\{(3 + - sinc 2 ( £ £/2)))- (fc - V(£). 

In light of this computation, the fact that — sinc 2 (e£/2)) = £ 2 + C(e 2 ), and the fact that we have proven 
the first assertion in the statement of the lemma, the second assertion now follows. 
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To prove the third assertion in the statement of the lemma, write iV( £ )(x) = N^°\x) + \x 2 r\(e 2 x) and 
compute d(N £ (x) - N°(x)) = xr](e 2 x) + \e 2 x 2 -q' {e 2 x) and d 2 {N e (x) - N°(x)) = r](e 2 x) + 2e 2 xi]'(e 2 x) + 
\e^x 2 r]"{e 2 x). Also compute N°(x) - N°(y) = \{x + y)(x - y), dN°(x) - dN°(y) = x - y and d 2 N°(x) - 
d 2 N°(y) = 0. Now use the triangle inequality together with the fact that rj is smooth with 7/(0) = and the 
fact that to bound the H s norm of a multiplication operator it suffices to bound the multiplier to conclude 
that 

\\d k N e (x) - d k N a (y)\\ H i < \\x - y\\ H i + Ke 2 uniformly for || x \\ H g + <2 with k = 0,1,2 

as desired. 

To prove the fourth assertion in the statement of the lemma, let P a denote the pseudodiffcrcntial operator 
with symbol p a (Q = P°{£ + io>) = ^ + -^ +/3 - We claim that 

e a '(I-P°dN°(cf> ))- 1 e- a - = {I-P a dN\^))-\ (77) 

To prove the claim, first observe that for any operator L with 1 £ p(L) we have e a '(I — L)~ 1 e~ a ' = (I — La)^ 1 
where L a = e^Le~ a \ Thus it suffices to show that e a ' P°dN((j)p)e' a ' = e a ' P e~ a - dN(<f>p), i.e. that the 
operator given by multiplication with e~ ax commutes with dN ((f>p). But dN°(4>p) is just multiplication by 
<j)p and multiplication operators commute, so the claim is established. 

In light of ([76]) is suffices to show that Q a := (I — PadN^^p))" 1 is bounded uniformly in C(E S ). Write 

Q a = (i- (i- p°a7v°(0^))- 1 (p Q - p^dN^y 1 (i - p^N ^))- 1 

and use the fact that the symbol p° is analytic in a strip |Imz| < b, to see (for example from the Neumann 
series) that the function a i— > ||Qa||£(i? 1 ) is continuous on an interval of the form [0, a max ], hence achieves its 
maximum. Thus j|Q a || is bounded uniformly for a < a max , as desired. This proves the fourth assertion in 
the lemma and thus completes the proof of the lemma. □ 



A. 2 Uniform Bounds in the Small e Regime for the Constant in the Toda Semi- 
group Decay Estimate 

To prove Proposition ^. 31 we rely heavily on [13], which itself makes use of [10]. The main result of [13] is ([60]) . 
in fact all of the claims of Proposition 14.31 are proven in [T3] except the claims that a = 0(e) and b = 0(e 3 ) 
which arc proven in [7] and |10j respectively and the claim that K may be chosen uniformly in c, which we 
prove in this paper. A refinement of the proof used in [13j yields the result. The method of proof is as follows. 
Lemma 3 of [13] shows that the linear evolution equation 

w t = JH"(0)w (78) 

is exponentially stable in weighted spaces, i.e. admits semigroup estimates of the form (|60[) . Proposition 7 
in [T3] constructs a Backhmd transformation which conjugates the flow of ([55]) on the space of exponentially 
localized profiles which satisfy ([59]) with that of (|78|) on the uniform space I 2 x I 2 . The important observation 
that it suffices to study the Backlund transformation at t = is made in Corollary 8. Finally, in Corollary 
10 of [13] it is shown that the Backlund transformation and its inverse are bounded, thus the estimates ([60]) 
which are valid for (|78j) are necessarily also valid (with some constant K(c) which may now depend on c) 
for the equation ([55)) . With this in mind, to prove Proposition 14.31 it suffices to show that the Backlund 
transformation and its inverse are bounded uniformly for c > 1. 

Let k = k(c) be the unique root of sinliK = nc, let Q c (n,t) = log C o°h^K(n^~ct+i)) ^ c ^ nc Toda soliton 
with speed c and let A : £ 2 — > £ 2 be the multiplication operator e^'' ), given more explicitly by [Ax] n = 
-^^^^ijj x n- Because we work in slightly different coordinates than [T3], the Backlund transformation that 

we study is ^ c given in the notation of [13] by fy c = A$ c (0)A _1 and more explicitly by 4" c (r, p) = (r' ,p') and 
^/~ 1 (r',p') = (r,p) where 

CJ-V =p + CS~ 1 r ( Cb- 1 r = C&- l r'-p' 

and \ (79) 
p' = C5- 1 r-C5- 1 r { p = CS^r' - CS^r 
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Here 

S:=(S-I), C:=A-SA-\ C := A- A~ 1 S~ 1 , C := A - SA'^S' 1 , and C = A - A' 1 . (80) 

The coordinates (r, p) evolve according to the Toda model linearized about a solitary wave with speed c and 
must satisfy the orthogonality condition (|59p while the coordinates (r',p') evolve according to the Toda model 
linearized about the solution and may live anywhere in £ 2 (Z 2 ). 

Equation (|T9[) is exactly equation (29) in [13], rewritten in (r,p) coordinates. Note that ^ c does depend 
on c. Other than 5, each of the operators defined in (|8"0"|) depends on A which is defined in terms of k, which 
in turn is defined in terms of c. 

Proposition 14.31 may now be regarded as a corollary of the following lemma, together with the fact that 
= 1 _l- a , that k = 0(e) and that a = 0(e). 



already been bounded. To bound the last term note [(A 1 S — SA 1 )x] n — q,l+ ^ q Q " q " +2 = o(k). This 



Lemma A.l. Let 8, C C, C, and C be defined by (|80[) . Then there is a constant K, independent of c such 
that the following hold 

1. \\C\\ c{q} + ||C|U(*j) < Kk. 

2. WCS^Wc^j + \\CS- 1 \\ C (ii) < K. 

Suppose in addition that r, p, r' , and p' satisfy equations (|79[) with (r,p) satisfying (|59p . Then 

3. Wr'Wez < K\\p + 06-^2. 

4. \\r\\ el < K'WCS-'r' - p'\\ el . 

- 2 _ 2 

Proof of Lemma \A.l[ We bound each expression in turn. Let q n — cosh(riK). Then [Ca;] n = q l - ~ 
±2sinhK w ±2k. Similarly [Cx] n = q "~^ 2 ~ ±2sinh/t w ±2«. We write C5- 1 = C*^ 1 + A~ 1 S~ 1 to 
bound C5- 1 . Also write C^ 1 = C + A~ 1 S~ 1 + (A~ 1 S - SA'^S' 1 . The first two of these terms have 
already been bounded. To bound the 
completes the proof of 1. and 2. above. 

To establish 3 and 4 we solve the equations C8~ 1 x = y and C5~ 1 x = y for x in terms of y. This is 
not always possible because C has a one-dimensional kernel and its formal adjoint C has a one-dimensional 
cokernel. This will manifest in our expressions for x as either a consistency condition that x must satisfy 
or the presence of a free parameter in the solution for x. However, it is known that the map ^ c defined by 
(|79|) is an isomorphism from the subspace of -^(Z 2 ) 011 which (T59[) holds to £ 2 (Z 2 ). Thus in the context of 
conditions 3. and 4., the consistency equation will always hold and the free parameter will be fixed by (I59p . 

We now invert C8~ x x = y. The equation Cz = y is nothing more than a non-autonomous first order 
linear difference equation and thus may be solved (e.g. with a discrete integrating factor) to obtain q 2 ^ +1 ZM — 

q'm+i z m = ^2ji=m <? q +1 Vn which is valid for any integers to, M. Taking either M — > oo OR m — ► — oo we 

obtain the expression Z). = — ~Y^ = ^ ^^ryn = S«=-oo Y^Vn- The fact that the range of C has codimension 

1 manifests in the condition that Cz = y may only be solved when X^nez ^If^Vn = 0- One may show that 
this condition is satisfied for y = p + C8~ 1 r so long as (r,p) satisfy ([ST))) . Applying 5 to the result we see that 

3 2 2 00 3 2 2 k — 1 3 

. Qk+l ~ Ik tn+1 qk+l . qk ~ Qk+l Qn+1 / i\ 
Xk = — —Vk + — > Vn = Vk + — > Vn- (81) 

One may now choose the first representation of x for k > 0, the second for k < 0, regard each as a convolution 
by changing the index n to k — n, estimate that the coefficients in the sums in ([81]) are bounded by Cne~ 2K , 
and finally apply the Hausdorff- Young inequality for convolutions to see that 

\\xWt* < C(l + «||e-( 2 "-°)-||^)||y||^ < C||y||*2. 

In the last inequality we have used that a < 2k. This establishes assertion 3. The proof of 4. is similar. Wc 
may solve Cz = y to obtain 



Zk 



qk+iqk+2 , v^fc^ 1 1k+iqk + 2 h ~> (\ 



q k +iq k +2 _ y-l q k+} q k+ 2 fc<Q 
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Here zq appears as a free parameter which is fixed by the orthogonality condition (|59p in our case, i.e. when 

z = <5 _1 r and X^nez q +1 Un = f° r V = P+ C5 r. One may now prove assertion 4 using ideas similar to the 
above. 

□ 

A. 3 Uniform Bounds in the Small e Regime for the Neighborhood on which 
Tubular Coordinates are Valid 

Proof of Proposition \3.4\ Let To be given as in Lemma 13.31 Assume the conditions 

(i) \c a -c* a \ < Ce 2+ v (ii)aT a >T Q 

and define 

Q := {(T + ,T_,c+,c_,u,7i) e R 4 x £ 2 {Z,M. 2 ) x f a (Z, R 2 ) n £ 2 _ (Z, R 2 ) | (i) and (n) hold}. 

Define F : Q -> R 4 by F(t,c,u,u) = col F 2+ , Fi_, F 2 _) with F 1)Q = w a (£i, a , (w - and T 2 , Q = 

Wa(^2,a(w— u)h a ). Here is evaluated at (r a , c a ) and u is evaluated at (r + , r_, c+, c_), which we abbreviate 
as (t, c) . Observe that if we regard u as the solution of © under study and v\ as another exact solution of 
{2j intended to capture the part of the perturbation which is not exponentially localized, then upon denoting 

i>2 := u — u, v\ := u — u, and Vi >a = Uj/ia for i = 1, 2 

then «2,q satisfies the orthogonality conditions (f2Tj) and (f22|) for a e { + , — } if and only if F(t,c, u, u) = 0. 
We remark that it is natural to proceed by applying the implicit function theorem to find the zero set of F 
near (r*, c* , u, u). In fact, this has been done in [IT] and [8]. However, small e asymptotics are not of interest 
there, and the naive estimates from the implicit function theorem on the size of the neighborhood on which 
these coordinates are valid are not sufficient for our purposes. Rather, we construct an explicit contraction 
mapping and exploit the structure of F to show that the coordinates r, c, and v 2 are defined so long as (|35|) 
holds. 

Before we proceed it is convenient to establish the small e asymptotics for F and its derivatives. Let A 
be the matrix studied in Lemma 13.31 and let 

c = ( uj a (d T ^ a ,(u-u)h a ) -u} a (d c ^ a ,(u-u)h a ) \ nd c = ( C+ \ go that 

a \ U a (d T &,a, (U -U)h a ) UJ a {d c £,2,a, (u - u)h a ) J \ C_ J 

D( TjC )F(t, c,u,u) = A + C + exp. Use (|28p to see that when 

||«-fi||+ \\(u-u)h a \\ a <A'e 5 / 2+ " 77 > 0, (82) 

ae{+,-} 

which corresponds to (|35p . the terms in C are dominated by the terms in A (except for the upper left 
term, whose counterpart in A is exponentially small). Thus so long as (I82p holds, it follows that for e suf- 
ficiently small Dr TjC -\F is invertible. Moreover, to leading order in e, the off-diagonal blocks of (Z?( T:C )T) _1 

are exponentially small and the diagonal blocks have the structure ( i £ 2 +»f ) wn i cn ^ s similar to (|33[) . 
Similarly, one can compute D 2 Fi a = ( «2^V* ai'r^ 1 ' 1 " I anc ^ D 2 F 2 a = ( ajj^,™ a^'iT^ 2 '" I where 

V "T,c-^l.a °cTl,a / V C' Tc i'2,a <J c ^2,a J 

d 2 Fi. a = uj a (d 2 £,i, a , (u - u)h a ) + u a (d T £ l!a , £ 1<a ), <9 2 c Ti iQ = w Q (<9 2 rc) £i iQ , (it - u)/i Q ) - w Q (<9 T £i, Q , £ 2 ,a), 
9 2 T 1;Q = u> a (d 2 £ lt0l , (u - u)h a ) + w a (d c €i >a ,€2,a) + w a (£i, Q , <9 2 £ 2 , Q ) and d 2 F 2: a = uj a (d 2 £ 2 , a , (u - u)h a ) - 

^ a {dr^2,a,^l,ah a )-^ a (^2,a,9 T ^l, a ), d 2 T c) F 2 ^ a = W a (<9 2 ^ c) £ 2;Q , (u-u)h a )-U a (d T £ 2 , a , ^2,ah a )-U} a (d c ^ a , £l 1<x h a )- 

0J a (^2,a,9 c ^i ta ) and <9 2 T 2iQ = uj a (d 2 £ 2 , ai (u - u)h a ) + w Q (<9 c £ 2iQ , £ 2 ,a)- 
Use (|28p . (|82p. and p9p to obtain the leading order expressions 

£ 2 ^i,a ~ ( ^ f-i ) D 2 F 2 , a ~ ( ^4 ) . (83) 

Note that to control the terms involving u> a (-, (u — u)h a ) we have used the fact that the first term in 
the symplectic product contains a factor of £i iQ = -£-JH'{u Ca ). The presence of J allows us to replace 
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the symplectic inner product with a Euclidean one and then use the regular Euclidean Cauchy- Schwartz 
inequality rather than (|19|) . This is important because u — u need not lie in the weighted space. 

As in the proof of the implicit function theorem we set up a contraction mapping. It will be convenient 
to let A := D^ t c )F(t* , c*, u, u), to let x denote the variable (r, c), and to fix u and u and to denote F(x, u, u) 
simply as F(x). Define G(x) := A^ 1 (Ax — F(x)) so that fixed points of G correspond to zeros of F. Write 

(F(x) ~ F(y) - A(x - y)) = [ £ (F'(tx + (1 - t)y) - F'(x*)) dt] (x - y) 

= (Jo 1 (io 1 F "( stx + s (! - t)y) ds ) ( tx + (i - t)y - x *) dt > ( x - v))- 



We renorm M 4 by |(t + , t_, c+, c_)| r := sJ £3 ( T l + T -) + £ ~ 3 ( c + + c -)- 

Restrict attention to the neighborhood of a;* on which |(t, c) — (r*, c*)\ r < e 1+2v i.e. on which 

\t-t*\ < e- 1+71 and |c-c*|<£ 2+ " (84) 

hold. Denote the r and c components of x — y by # T and 8 C respectively. Then using and (f8"4")) we have 



G(x) - G(y) 



£ 4+ "6> T + e 1+ 'W c \ f e'W T + £ 3 +"6> c 



Note that we have absorbed the off-diagonal terms which are exponentially small. Thus 

\G{x) - G{y)\ 2 r ~ £ 3+2l? ^ + e -3+2^2 = ^1(^,^)13. (85) 



£ 4 £ 1 \ / £ 3 / 2 ||u— till N f p' 1 



Moreover G(0) ~ I _ 1 c 2+j] ) [ 3/2 11 / ~ -\u 11 ) ~ ( 3 +'f ) wnere m the last relation we have 

used (52). Thus |G(0)| r - e 3 / 2+v . It now follows from the triangle inequality and (|85|) that for £ small enough 
G maps a neighborhood in ]R 4 of the form (|8~4"|) to itself and is moreover a contraction on this neighborhood. 
In particular G has a unique fixed point x — (r, c) in this neighborhood. Moreover, as is standard in implicit 
function theorems, we can let u and u vary in the defintion of G and the fixed point (r, c) = (f(u, u), c(u, u)) 
is smooth in u and it by the uniform contraction principle. 

It therefore follows that these functions r Q = f a (u, u) and c a = c a (u, u) map a neighborhood of (u(r* , c*), w(') 
to a neighborhood of (r*, c*) such that F(t, c, u, u) = if and only if c a = Cq,(u, u) and r Q = r a (M, u). More- 
over, the implicitly defined functions are unique and as smooth as F, i.e. C°° . Thus upon defining €2(11, u) := 
u — u(f (u, u), c(u, c)) we obtain a homeomorphism between a neighborhood U of (u(r*, c*), u(t* , c*)) to a 
neighborhood V of (r*,c*,0). 

Recall that r* and c* are arbitrary, thus for each r and c satisfying (i) and (m) we have a homeomorphism 
between U(t,c) and V(r, c). We now show that these local homeomorphisms may be patched together to 
produce a global homeomorphism betwenn U* and V*. 

To that end, suppose that 

u(t, c) + v 2 = u(f, c) + v 2 

so that 

u(t, c) — u(f, c) + u(r, c) — u(f, c) — v 2 ~ v 2 
by the mean value theorem there are numbers t\ between r a and f a and cj, between c a and c Q such that 

6 (r| , c+ ) (t+ - f + ) + 6 (rl , c_ ) ( T _ - f_ ) + & (f + , c\ ) (c+ - c+ ) + 6 (f _ , cl ) (c_ - c_ ) = v 2 - v 2 

In light of (J25J) we see that so long as we force H^MU + II^MU < Ce 3/2+2r ' it follows that \t—t \ < Ce~ 1+2ri 
and \c — c\ < Ce 2+2n . In particular, |r — f | and |c — c| are sufficiently small for the local inverses to hold. □ 
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